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Preface 


This quarterly publication provides archival reports on developments in 
programs managed by JPL’s Office of Telecommunications and Data Acquisition 
(TDA). In space communications, radio navigation, radio science, and ground-based 
radio and radar astronomy, it reports on activities of the Deep Space Network (DSN) 
and its associated Ground Communications Facility (GCF) in planning, in support- 
ing research and technology, in implementation, and in operations. Also included 
is TDA-funded activity at JPL on data and information systems and reimbursable 
DSN work performed for other space agencies through NASA. The preceding work 
is all performed for NASA’s Office of Space Operations (OSO). The TDA Office 
also performs work funded by two other NASA program offices through and with 
the cooperation of the Office of Space Operations. These are the Orbital Debris 
Radar Program (with the Office of Space Station) and 21st Century Communication 
Studies (with the Office of Exploration). 

In the search for extraterrestrial intelligence (SETI), the TDA Progress Report 
reports on implementation and operations for searching the microwave spectrum. In 
solar system radar, it reports on the uses of the Goldstone Solar System Radar for 
scientific exploration of the planets, their rings and satellites, asteroids, and comets. 
In radio astronomy, the areas of support include spectroscopy, very long baseline 
interferometry, and astrometry. These three programs are performed for NASA’s 
Office of Space Science and Applications (OSSA), with support by the Office of 
Space Operations for the station support time. 

Finally, tasks funded under the JPL Director’s Discretionary Fund and the 
Caltech President’s Fund which involve the TDA Office are included. 

This and each succeeding issue of the TDA Progress Report will present mate- 
rial in some, but not necessarily all, of the following categories: 

OSO Tasks: 

DSN Advanced Systems 

TYacking and Ground-Based Navigation 
Communications, Spacecraft-Ground 
Station Control and System Technology 
Network Data Processing and Productivity 
DSN Systems Implementation 
Capabilities for Existing Projects 
Capabilities for New Projects 
New Initiatives 

Network Upgrade and Sustaining 
DSN Operations 

Network Operations and Operations Support 
Mission Interface and Support 
TDA Program Management and Analysis 
Communications Implementation and Operations 
Data and Information Systems 
Flight-Ground Advanced Engineering 

OSO Cooperative Tasks: 

Orbital Debris Radar Program 
21st Century Communication Studies 
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Search for Extraterrestrial Intelligence 
Goldstone Solar System Radar 
Radio Astronomy 
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An Analytic Development of Orbit Determination 
for a Distant, Planetary Orbiter 

R. K. Russell and S. W. Thurman 
Navigation Systems Section 


With the advent of the Mariner ’71 Mission, NASA has been sending spacecraft 
to orbit various distant bodies within the solar system. At present , there is still no 
adequate theory describing the inherent state estimation accuracy, based on two- 
way, coherent range-rate data. It is the purpose of this article to lay the groundwork 
for a general elliptic theory, and in addition to provide an analytic solution for the 
special case of circular orbits. 


I. Introduction 

When one begins to analyze a possible future mission 
employing an orbiting spacecraft, it is desirable to know 
the characteristics of the position and velocity errors of 
that spacecraft, based on a single revolution of range-rate 
data. To know these errors as a function of orbit shape, 
size, and orientation with respect to the Earth is the cen- 
tral issue. If range were the data type of interest, the prob- 
lem would be vastly simpler, since velocities would not be 
involved. Unfortunately for the analyst, however, range- 
rate data is central and must be treated. This particular 
problem has proved to be a most formidable analytic en- 
deavor. 


II. Development of the Observable 

One of the most important aspects of this orbital es- 
timation problem is to pose it properly. In this regard, the 
“binary star problem” of classical astronomy can provide 
insight. In attempting to obtain the orbits of a distant bi- 
nary star system through the use of spectroscopic Doppler 


shifts, astronomers discovered that they could determine 
all the elements of the motion except one. The one ele- 
ment, which turned out to be completely indeterminate, is 
the longitude of the ascending node relative to the plane- 
of-the-sky, with the plane-of-the-sky being that plane nor- 
mal to the line-of-sight from the observer to the binary 
system. 

The reason for this indeterminacy is that the binary 
star system is effectively “at infinity,” meaning that no 
significant parallactic motion exists between the observer 
and the observed. 

In this problem, the body about which the spacecraft 
is in orbit is not “at infinity,” but is relatively near. This 
implies that there will be some parallactic motion between 
the observer and the observed, and hence we should be 
able to obtain a “reasonable” solution for the longitude of 
the ascending node relative to the plane-of-the-sky. This 
parameter will typically be the principal contributor to the 
spacecraft state error. As such, classical elements relative 
to the plane-of-the-sky appear to be a sensible “eigen” 
system in which to solve the problem. 
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The geometry of a spacecraft in orbit about a distant body is depicted in Fig. 1. Let: 






( 1 ) 

( 2 ) 


where x p , y p , and z p are the plane-of-the-sky Cartesian position coordinates of the spacecraft and x p , y p , and z p are the 
plane-of-the-sky Cartesian velocity coordinates of the spacecraft. 

The orbital coordinates are then defined as: 



with 


£ = a(cos E — e) 
r] — a/3 sin E 
< = 0 


( 3 ) 

( 4 ) 

( 5 ) 

( 6 ) 
( 7 ) 
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n = 
C = 
0 = 



an sin E 
— e cos E 
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( an/3 cos E \ 
1 — e cos E ) 

0 




( 8 ) 

( 9 ) 

( 10 ) 

(ii) 


where E is the eccentric anomaly, a is the orbit semi-major axis (km), e is the orbit eccentricity (dimensionless), and 
n is the orbit mean motion (s -1 ). 

The orbital coordinates can be related to the Cartesian, plane-of-the-sky coordinates as follows: 


r p = R T (Sl,L>,i)r 0 


( 12 ) 


where f? T (fi,w,i) is the transpose of the Euler rotation matrix, defined as 
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sin i sin w \ 


R{Q,,<jj y i) 


/ cos £2 cos u — sin £2 cos i sin cj sin £2 cos u> + cos ft cos i sin u) 

— cos ft sin u> — sin ft cos i cos u> — sin 12 sin u) + cos 12 cos i cos u; sin i cos a; I 
\ sin 12 sin i — cos 12 sin i cos i ) 


(13) 


and 12 is the longitude of ascending node relative to the plane-of-the-sky, w is the argument of periapsis relative to the 
plane-of-the-sky, and i is the inclination relative to the plane-of-the-sky. 

To have any hope of analytically solving this problem, all possible simplifications must be made without, of course, 
loss of generality. One such simplification will be to define the x-axis of the plane-of-the-sky system to be the line-of- 
nodes. This implies that 


£2 = 0 


(14) 


Thus R t ( 12,0), i ) becomes 




/ cos u> — sin u? 

sin cj cos i cos u> cos i 
\ sin LJ sin i cos c j sin i 


0 \ 
— sin i 

cos i / 


If it is now assumed that the orbital elements are constant in time, 


(15) 


r p = R T (u), i)r 0 (16) 

From Fig. 1, it is seen that 

r d = r p — r T (17) 

( Vjyt COS 9 \ 

Vjsft sin 6 (18) 

VRt + pet / 

where: 

Vs is the component of the velocity of the target central body relative to the Earth that lies in the plane-of-the-sky; 
this shall hereinafter be referred to as the “cross-velocity” 

Vr is the component of the velocity of the target central body relative to the Earth that lies along the line-of-sight 

Pet is the distance along the line-of-sight between the center of the Earth and the center of the target central body 

at the initial epoch of estimation, i.e., at t = 0 

9 is the angle between the cross-velocity and the line-of-nodes 
One can now form the range-rate observable, p , as 
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where 


p = M 


( 20 ) 


Therefore, 


where 


It can then be shown that 


pp = r p * r p - r p fr - ry • r p + r T • r T 


ir = 


( Vjy cos 0 
V/v sin 6 
V R 


t t • r T = 4- 4- VrPet 

r p ip — a 2 ne sin E 

r p • tt = Vat (^ ori 4- ^ 2 ) 4- Vr(£ sinw 4- 77 cos w) sin i 

r T -r P = Vtft((ai 4- »)of 2 ) 4- (Vh< 4- />£t)(£ sinu> 4- r)cosu>) sin i 

where 

ai = cos 0 cos w 4- sin 0 sinw cos i 
(*2 = — cos 0 sin u> 4- sin 0 cos u cos i 

Next one has 

P = (r d ■ r d y = [r p • r p + r T • r T - 2r p • r T ] 3 

This can be shown to be 

p — |a 2 ^1 - e cos E^j + p% T + 2 petVrI + + V^jt 2 

— 2V/v<(^ai + r?a 2 ) - 2 (Vr* + Pet)(Z sinw + » 7 Cosu>)smi 


( 21 ) 


( 22 ) 


(23) 

(24) 

(25) 

(26) 

(27) 

(28) 


(29) 


(30) 


For large pet this implies 


1 

P 


1 

PET 


[l + 2 ™- 

2 sin i f 

£ sinu; 4* ttcosu;} 4* o( 

Pet 

Pet t 

J 'Pet' J 


( 31 ) 


This then becomes 


1 1 

- « 

P Pet 


i _ Y*L + sint 

PET PET 


sin u + r) cos u) j* j 


Combining Eqs. (21), (23)— (26) , and (32) yields 


p « — — IpExViR — Pet\£ sinu> 4- ») cost*;} sin i + Vj$t + o^ne sin E 
Pet L l J 

~Vjv|(^ + £t)a i -h (77 + f}t)ot2^ ~ sinu; + fjcosu; j|£ sin a; + tjcosu; j sin 2 i 


(32) 


(33) 


From the above, one can observe that 


|^sinu; + r/cosc*;} = a|(cosi£ — e)sinu; + /?sin E cosu;| 

sinu; 4* 77 cos u>j = — e~s~£?) {*- sin i? sinu; 4 (3 cos E cos a; | 


(34) 

(35) 


Combining Eqs. (33)-(35) produces 


( an sin ^ \ f . _ . „ _ 1 

Vr — - — { — sin E sin u 4 p cos E cos u > 

\ 1 — e cos E J 1 J 

H — { V£t — ( SSL* ) {sin E sinu; — /?cos£ , cosu;}{(cos E — e)sinw 4/? sin E cosu>} 

PET { \l - ezosE J l J l J 

+ a 2 ne sin i? - Vv{(f + £<) a i + (*1 + jj 


(36) 


Now one proceeds to expand the last term of Eq. (36). To do this, Kepler’s Equation is needed: 


E — e sin E = n(t — Tp) 


(37) 


where Tp is the time of periapsis passage. 
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Without loss of generality, one may assume that Tp = 0. Therefore 


nt = E — e sin E 


( 38 ) 


Combining Eqs. (5), (6), (8), (9), and (38) produces 


{(£ + fO«i +(*? + ^)a 2 } 

r / „ (i? — esin#) sin ./ . _ (£ — e sin E) cos E \ x 

= a\a l lcosE-e-± } — +a 2 [sinE+i } — \ 

l\ 1 — e cos E / V 1 — e cos E ) > 


(39) 


The range-rate approximation then becomes 


p & \ Vr — ( qns ? n * j f gin ^sinw + (3 cos E cos w } -+ — { V^t + a 2 ne sin E 

[ \ 1 — e cos E J l J pet L 

— ns * n |sin j£sinu> — (3 cos E cosu; j|(cos E — e) sinu; 4- /?sin E cosu;| 


- Vjva{«i(' 


cos E — e — 


(E — esin 2?) si nE 
1 — e cos E 


) + c* 2 /? (si 


sin E + 


(£ — esin £■) cos# 
1 — e cos E 


)}}] 


(40) 


If one wished to compute an approximation to the actual received range-rate signal, ignoring the motion of the 
receiving station relative to the center of the Earth, Eq. (40) would be the expression to use. In this case, however, the 
range-rate approximation shall be used for the development of partial derivatives only, and the only partial derivatives of 
interest are with respect to the spacecraft state (classical orbital elements with respect to the plane-of-the-sky). Hence, 
the elements of interest are a, e, Tp, i, and the final element is 0. Note that 0 by our definition is equivalent to the 
longitude of the ascending node, Cl. 

Thus those terms in the range-rate approximation which do not contribute to the appropriate partial derivatives 
will hereinafter be ignored. Additionally, those terms which contribute negligible partial derivatives will also be ignored. 
As a result, the range-rate approximation which shall be used for the computation of the partial derivatives is 


/ ansint \ f . n . ^ ^ 1 

P « “ — s sin E sin u — p cos E cos u > 

\ 1 — e cos E J ^ > 


-(£){*■( 


cos E — e — 


(E — esin £')sin 


1 — e cos E 


+ot 2 0 (si 


sin E + 


(E — e sin E) cos E 
1 — e cos E 


)}] 


(41) 


The partial derivatives with respect to a, e, Tp, u, and i shall be computed from the first term in the range-rate 
approximation (since these dominate partials for the same parameters taken from the second term), whereas the partial 
derivative with respect to 6 will be computed from the second term (this is the only term with 6 variation). 
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III. Obtaining the Partial Derivatives 

Rewriting Eq. (41) produces 


r _i 1 . 
a 3/x 2 sin 

1 — e cos E 


7 j |sin E sin u — 0 cos £* cos 


_ f 

VPET/ \ 


(£ — e sin £*) sin £\ . / . _ (E — e sin E) cos E 

<*1 ( cos E — e — - ( sin E + - } — 

1 1 - e cos E ) \ 1 - e cos E 


)}1 


(42) 


To produce the partial of the observables with respect to the classical state of the spacecraft, the partials of the eccentric 
anomaly, E } with respect to the classical state are needed. They are 



3 (E — esin E) 
2a(l — e cos E) 

sin E 

(1 — ecos E) 
n 

(1 — ecos E) 


= 0 
= 0 
= 0 


Taking partial derivatives of the observable produces 




{— sin E smu + /?cos jEcosu;} 
(1 — e cos E) 


3 (E — esin E) {(cos E - e) sin a; + /? sin E cos uj] 
(1 — ecos E) 3 



an sin i 


— cos E {(cos E — e)cosu — /?sin 2£sinu;} 
(3(1 - ecos E) 2 


+ 


sin E {(cos E — e)sina; -f /? sin E cos u > } 
(1 — e cos E) 3 


(43) 

(44) 

(45) 

(46) 

(47) 

(48) 


(49) 


(50) 
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(51) 



—an sin % 


(cos E — e) sin u + /? sin E cos u 
(1 — ecos E) 3 


an sin i 


sin E cos lj “I - f3 cos E sin 


(1 — e cos E) 


an cos t 


. [ sin E sin u — (3 cos E cos u> 


(1 — e cos £) 


- (£) M 


cos E — e — 


(E — e sin f?) sin £ 
1 — e cos £ 




sin E + 


(£ — e sin E) cos E 
1 — e cos E 


}] 


(52) 

(53) 

(54) 


where 

(55) 

(56) 


c *3 = — sin 0 cos cj + cos 0 sin cj cos i 
a 4 = sin 0 sin u; + cos 0 cos u; cos i 


Again, if one is to get through this problem, one must simplify as much as possible or risk getting lost in the 
algebra. A careful examination of the partials detailed in Eqs. (49)-(54) reveals that the first term of Eq. (49), other 
than a constant of proportionality, is the same as Eq. (53). Therefore it may be eliminated from Eq. (49) by a change 
of variable. Let 


which implies that 


From Eq. (57), one has 



a + ( 2 a cot i)i 


(g ) = (£) it) + (I) (£) 



Therefore, 


dp\ / — 3nsini\ (£ - e sin E) { (cos E_ - e) sin u + /? sin E cos w) 
aa 7 J - V 2 ) (1-ecos E) 3 


(57) 


(58) 


(59) 

(60) 


(61) 


To obtain the correct partials in Eq. (58), one must carefully discern which of the variables in Eq. (57) are differentiable 
parameters and which are not; it is then clear that Eq. (61) represents a considerable simplification of the problem. 


S 


IV. Accumulating the Information Array 

The normal, Gaussian way by which to obtain the covariance of errors in a set of estimated parameters, T, is stated 
as follows: 


r = [f ~ 1 +a t wa^ 1 


(62) 


where f is the a priori error in the covariance, A is the vector partial derivative of the observable (at the time of 
observation) with respect to the parameters being estimated, and W is the weighting matrix. If it is assumed that the 
error in the measurement of the observable at time t{ is uncorrelated with the error in the measurement of the observable 
at time tj , for all i and j, then W becomes a diagonal matrix of dimension equal to the number of observables. From 
minimum- variance concepts, the diagonal elements of the weighting matrix may be shown to be just the inverse of the 
variance of error in the observable (it is assumed the error process of the observable is stationary and hence that the 
variance of error in the observable is constant and unchanging for every data point). 

If one assumes the existence of no a priori information, then the error covariance sought, assuming discrete obser- 
vations, may be formed as the inverse of the information array: 


r = 



(63) 


where X is the vector of estimated parameters (in this case this is the spacecraft state in classical elements relative to 
the plane-of-the-sky), which shall be ordered in the following way: 


— (pi) P2, P3y P4y Pb, = ( 9 , a 7 , i, e, 7>, 


and N is the number of data points (observations). 

In this case, to simplify the accumulation, continuous observations are assumed, and hence the sums involved will 
become integrals. For further simplicity, the accumulation shall be performed over a full orbit period in the time domain 
P (in the manner in which “real” data is taken, in equal increments of time), beginning at —P/2 and ending at +P/2. 
Since the observable and the partials are all formulated in terms of the eccentric anomaly, E, it is logical to perform the 
accumulation with this parameter as the independent variable. From Eq. (37), one has 


dt = v 1 ~ e cos ~l dE 

n 

Thus a general information array element, /*j, may be stated as 


(64) 


lij 


1 r ,et\ (M\ 

(a<r}) y_ T V dpi ) \ dpj ) n 


(65) 


where a is a scale factor chosen such that the information array, Uj , obtained continuously is the same as that formed 
discretely. 
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To ascertain the value of a, let X consist of only one parameter, with its attendant partial derivative being a 
constant, K. Then the discrete information is just 


/n = 



The continuous information, however, is 


/n = 



( 66 ) 


(67) 


where P is the orbit period (in seconds). 

Therefore, for the two informations to be equal, one must have 

*-(£) (68) 

This, however, implies that 

a = At (69) 


where At is the discrete sample rate (in seconds) . 

In the standard case of two-way, coherent range rate as the orbital data type, the discrete sample rate is 60 seconds. 
Thus the elements of the information array may now be stated using the continuous formulation as 


- _J_ r (°L) (°L) 

l '> ~ (alAt) J.Adpi) \dpjJ 


(l-ecosff) i£ , 


(70) 


For simplicity, scaled information array elements, I-j , shall be stated where 


* ■ (**) 


(71) 


Note that Eq. (71) implies that when the determinant of the full 6x6 information array is computed, to obtain a correct 
value, the result must be multiplied by 
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Similarly for the cofactors of dimension 5x5, the result must be multiplied by 


(^) s 

Since the information array is symmetric, only the upper diagonal portion of the array shall be stated. 

If one examines Eq. (42), it can be seen that the range-rate value obtained for a certain value of u> is the negative of 
that for u + 7r. The partials will also be of opposite signs. The information array elements will, however, be the same. 
Hence the information array must be periodic in u>, but go through two full periods as u goes from zero to 360 degrees. 

With the complicated nature of the partials, there is a need for a very specific table of integrals to perform the 
accumulation. Such a table is provided in the Appendices. With the use of the Appendices, the elements of the 
information array can be straightforwardly computed. 

Due to the complexity of the information array, any further simplification will be most welcome. We are only 
interested in obtaining the variance (standard deviation) of 0, which is the error in the line-of-nodes relative to the 
plane-of-the-sky. Any element in the inverted information array (the covariance), , can be stated as 


_ U-\) i+ iCo D tJ 

11 I D 


(72) 


where D is the determinant of the full 6x6 information array, i 7 , with the appropriate constant factors included, and 
CoDij is the cofactor of the ij th element of the information array, which is just the determinant of the remaining 5x5 
array obtained by removing row i and column j from the full 6x6 information array, with also the appropriate constant 
factors included. 

In this case, we are interested in Cn only. This is just 


Cn — Vg — 


Co Du 
D 


(73) 


If it is assumed that there are removable coefficients, a,-, in each of the ith rows and ith columns of and similar 
coefficients, 6;, coming from the 5x5 matrix whose determinant is the CoD n , then 


Cn — — 


CoD\A 

& y 


UU b l = 1 ( CoD' n \ 

n - =1 «. 2 d > ) 


where Coiy n and D? are the corresponding determinants with common factors removed. 


(74) 


One can now state the information array, i 7 , to compute U and CoD \\ . The information array, with coefficients 
removed, is shown below. 
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( 75 ) 


/;, = [{y + 1 - 2« + 2e ! } («I + o|) 

+ (?) {' #SW + ^ ln { 2^} - J (» - «’> - ? + + < - |<» - w 1 )} (“5 - “I) 


where 


(a^ + a 2 j = sin 2 0 + cos 2 6 cos 2 i 


(76) 


(a 2 — a|^ = 2 sin 2u> sin 6 cos 6 cos i - cos 2 u (sin 2 9 - cos 2 6 cos 2 


1 + /? ^+2e + ±(l- /?)(!- 3/?)} 


7 ' 2 = [{ W‘ S(t) - 1 ln ( 1 + t) - + 0) + (T^) } 

-^{(21n(l + 7) + 31n|^ r ^} + 2e + | 
x (sin 0 cos 2w — cos 0 cos i sin 2 w j | 

l' 13 = 0 sin 0 + j? 2 + ^ ^ ln 1 } } (sin 0 cos 2 w - cos 0 cos i sin 2w ) 





/u = I ( — I < ln 


1 + 0 


(77) 


H 


I' l5 = cos * cosi ' - ( 2 (V+eK ( l 1 + t /^ ) ( s * n ^ sin 2« + cos 0 cos j cos 2w) ] 

/' 6 = ^[ co8<,cos *‘ - (t 2 + ! + ? 111 { 2[n4 }) ( si 


sin 9 sin 2w -f cos 9 cos i cos 2u> J J 


(78) 

(79) 


e/9/ r l2(l + e)} +2 " W + W }“°‘' 

^ |(2 — e 2 ) ln 1 2 (\~+ ) } + - - 0) + 2e - e 4 } (sin0cos2u> - cos0 cos»'sin2u;jJ (80) 


(81) 

(82) 


I'-n = ~ ji !“(! + 7) + £(1 - e) - ^ + 20+ 3)} 


„ fe 2 „.. (2 — 5e 2 ) (l-e)(2 + e 2 ) 1 (1 + /J)(3 - 0)' 

- co82w ( 4^ 5( 7)+ 3 e 2 /3~ n(1 7) + +4p 



(83) 

(84) 

(85) 
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3c)/? 3 


7 ”~ (l2ev) [' + a + e)» 

r “ = [rh-iT^]" ,,2 “ 
/Sa = 1 "(lfl)“* 2 " 


sin 2 cj 


r, - (!-/?) rnn n r , 

34 “ e/?(l + /?) cos2w 


/35 = 0 


736 - (rrl) 


sin 2w 


V = * 

44 4/?3(l + /?) 


2(3+/?)+ o^ cos2w 


7 « - (4^) 


sin 2a; 




iu= 


1 

4/? 5 


2(2 + e 2 ) — e 2 cos 2u> 


r -1 

^56 - 


■^66 = 1 + 


(£0 


cos 2w 


( 86 ) 

(87) 

( 88 ) 

(89) 

(90) 

(91) 

(92) 

(93) 

(94) 

(95) 

(96) 

(97) 


where the removable coefficients, of course, turn out to be the coefficients of the individual partial derivatives, which are 


£ 

11 

0 

(98) 

V Pet / 

( — 3n sin *\ 

(99) 

— ( 2 ) 

03 = an cos i 

(100) 

a 4 = an sin i 

(101) 

a 5 = —an 2 sin i 

(102) 

ae = an sin i 

(103) 
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From these scale factors and Eq. (71), one can infer that the error in 9 may be written as 


<r e = <Tp 



Pet(p)H A<)’ 

/(e,w) 

a p 

. V«r(a)i0 0* . 

.g(0,i,e,u)' 


( 104 ) 


where /(e,o;) is the cofactor of I? n , and g(9, *,e,o;) is just the determinant, D* . 

V. Inverting the Information Array 

Since the interest is in a single element of the inversion, only those computations required in Eq. (74) shall be 
performed. Before plunging into the general elliptic case, the inversion for the special case of the circular orbit (i.e., 
e = 0) shall be demonstrated. 

A. Circular Case 

After much work, the elements of the information array (in the limit as e tends to zero) can be shown to be: 


1 12 


1 22 


*23 


/ 24 


*25 


*26 



(105) 

7 r 2 . 

= y sm« 

(106) 

= sin 9 -f ^ ^sin 9 cos 2a ; — cos 9 cos i sin 2a; j 

(107) 

= sin 9 — i ^sin 9 cos 2a; — cos 0 cos i sin 2a;^ 

(108) 

= cos 9 cos i — i ^sin 9 sin 2a> + cos 9 cos i cos 2a; ^ 

(109) 

= cos 9 cos i — i ^sin 9 sin 2a; + cos 0 cos i cos 2a; ^ 

(110) 

7T 2 1 

= — — - cos 2a; 
3 2 

(111) 

1 

= - cos 2a; 

(112) 

= 1 — - cos 2a> 
o 

(113) 

— — i sin 2a; 

(114) 

= — ^ sin 2a; 

(115) 

~ ^44 = ^55 = ^56 “ ^66 “ 1 

(116) 

= /85 = *36 - '45 = *46 = 0 

(117) 
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To perform the inversion in this special, circular case, it is necessary to remove the parameter u> from the estimation 
since the argument of periapsis is indeterminate. Additionally the parameter Tp is removed from the estimation since 
the partials are large and the same as those of w, only with the opposite sign. This rank reduction corresponds to 
assuming that the true anomaly of the spacecraft at epoch is perfectly known, which is realistic in comparison to the 
error in 9. With a large amount of algebraic manipulation, the four-parameter inversion yields an error in 9 of 


where 



Pet{p)*( A<)* 

/(«) 

(Tp 

V N (a)i(r 0* . 

g(9,i,ui). 


/(“ 0 = 


7T 2 , 1 „ 13 j 

1 + - cos 2w - — cos* 2w 

3 6 36 


(118) 


(119) 


,(* , i,„) = (2d£?di) [„< - ,» cos 2* - cos 5 V - £ + f cos 2* - f cos 5 2u.] 


( 120 ) 


It is now apparent from Eqs. (118) — (120) that the use of range-rate data for state estimation of a circular orbit about a 
distant planet can yield singular results. Singular state errors will result if 


9 = 90 deg, 270 deg, and/or 

(121) 

i = -90 deg, 90 deg, and/or 

(122) 

O 

II 

£ 

(123) 


This latter condition of singularity may occur when the spacecraft is orbiting a superior planet (at the beginning and end 
points of retrograde motion). These are important results, which are significantly at variance with orbit determination 
experience for highly eccentric trajectories, which appear well behaved and certainly not singular. Note that even with 
zero eccentricity, the results of Eqs. (119) and (120) imply a functional dependence upon u. This is an artifact of the 
definition of the coordinate system at time zero. If we assume w = 90 deg, then the maximum variation in the error 
in 9 for any other value of u can be shown to be 13.6 percent. Thus the error in 9 can be said to be only weakly 
dependent upon w. 


B. Elliptic Case 

Many thanks go to R. J. Muellerschoen of the Navigation Systems Section for his help in using MATHEMATICA 
to attempt the 6x6 inversion. Even with a large amount of available memory, the problem quickly consumed it all and 
more. Thus it appears unlikely that a “reasonable” functionality for / and g can be obtained in the general elliptic case. 
However, in having the information array, we have performed the accumulation in a general manner which will save 
much time. A large number of orbit-determination studies as a function of geometry relative to the plane-of-the-sky can 
be made quickly and cheaply. 
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VI. Results 

A. Circular Case 


Venus Orbiting Imaging Radar (VOIR), the progenitor of Magellan, was extensively studied many years ago. It 
was in these studies that the circular orbit singularities were first observed. The nominal orbit just happened to have 
conditions that were near-singular. As a result, orbit determination errors were obtained that were outrageously large 
(larger than the nominal orbit). This led to the use of several different computer programs to examine whether there were 
program errors or not. The two main programs used to investigate the state estimation were the Orbit Determination 
Program (ODP) and the Sequential Orbit Determination (SOD) Program. Both of these pieces of software yielded 
the same, poor orbit determination behavior and so confirmed the belief that we had inadvertently stumbled upon 
singularities in the orbit determination process for circular orbiters. The important parameters defining the nominal 
VOIR orbit were: 


Epoch 

Semi-major axis, a 
Eccentricity, e 
Inclination, i 
Argument of periapsis, u> 
Cross-velocity, Vn 
Target range, p E r 
Angle, 6 
Period, P 


March 17, 1982 
6552 km 
1.52 x 10' 4 

78.17 deg (relative to the plane-of-the-sky) 

170.7 deg (relative to the plane-of-the-sky) 

14.413 km/sec 

8.36765 x 10 7 km 

85 deg 

1.624 hr 


The computer solutions were assumed to have as a data base one full orbit of coherent, two-way range-rate data. A 
slight problem complicating exact comparison with the theory is that the computer programs employed assumed that 
when the spacecraft was not visible to the Earth tracking station, there would be no data available. This eliminated 
about 36 minutes of data (depending on the nominal inclination relative to the plane-of-the-sky) out of a total of 98 
minutes available. The theory, however, assumes that a full orbit of data is available no matter what. As a result the 
displayed errors are not in perfect agreement. The trends as functions of the angle 6 and the inclination i, however, are 
in excellent agreement and as seen in Figs. 2 and 3, clearly demonstrate the singular behavior of the circular orbit. The 
best place to compare theory and computer results is in Fig. 3 where the value of the inclination relative to the plane- 
of-the-sky is small. In this configuration, a full orbit of data would be available to both theory and computer results, 
yielding agreement to about 30 percent. As the inclination increases, the agreement between theory and computer 
results degrades as more data is occulted. The error parameter displayed in these figures is 


\<r x \ - r<Te - a<?9 


(124) 


B. Elliptic Case 

The orbit determination studies for the general elliptic case are yet to be published. 1 


1 For additional information on the initial studies which helped to delineate the circular orbit problems, see the following: J. Ellis 
and R. K. Russell, “Earth-Based Determination of a Near-Circular Orbit About a Distant Planet,” JPL Technical Memorandum 
391-406 (internal document), March 9, 1973; R. K. Russell, “Improved Orbit Determination of Low-Altitude, Nearly Circular 
Planetary Orbiters,” Engineering Memorandum 391-561 (internal document), April 25, 1974; R. K. Russell, “Analytic Orbit 
Determination, A General Methodology and a Successful Example,” JPL Engineering Memorandum 314-59 (internal document), 
August 9, 1976; and L. J. Wood “Orbit Determination Singularities in the Doppler Tracking of a Planetary Orbiter,” Journal of 
Guidance, Control, and Dynamics , volume 9, number 4, p. 485, July-August 1986. 
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VII. Conclusions 


It has been seen that circular orbits about distant planets may suffer singularities in over-all position error estima- 
tion. These singularities are due to orbit inclination, placement of the line-of-nodes, and insignificant cross-velocity at 
the start and end of retrograde motion when orbiting a superior planet. 

Even though these conclusions appear to yield poor state estimation, one should not be unduly alarmed inasmuch as 
the stated conditions for singularity are not maintained for extended periods during typical mission scenarios. However, 
mission analysts should be aware of these potential pitfalls and realize that spuriously large results for circular orbiters 
can be obtained and are not the result of incorrect assumptions or faulty software. 

The general elliptic problem appears so involved that analytic inversion at this time is just not feasible, and in any 
case the resulting expression for the position error would likely be so lengthy that any understanding would be lost in 
the maze. 
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z (TOWARD EARTH AT t = 0} 



Fig. 1. Spacecraft orbital geometry in the plane 
of the sky. 



Fig. 3. Norm of position error versus inclination, i, 
to the plane of the sky (with 0 ~ 0 deg). 




Appendix A 
Required Integrals I 

To perform the accumulation of the information array, a large number of difficult integrals of mixed trigonometric 
and algebraic functions and their powers are required. Contained herein are the necessary results. Note that due to the 
limits of integration, all integrals of odd functions will vanish. 

An important, general integral, from which some of the results contained herein are derived, is 




dE 




+ b cos E) n+1 ( a 2 _ 52)^ 


rPn 



(A-l) 


where P n are the Legendre polynomials. 


This integral is found in [1; p. 383, 3.661, 4]. 
For simplicity and convenience, we shall define: 


P — \J 1 — e 2 

The integrals necessary to perform the accumulation are: 


J (1 — e cos E)dE — 2 tt 
/> — e cos E) cos EdE = — 
J (1 — e cos E) cos 2 EdE — ? 

b°= 

L 

J cos 2 EdE = it 
J cos 3 EdE = 0 
J E sin EdE = 27 r 
J E sin E cos EdE = — j 


27T 

cos EdE = 0 


(A-2) 

(A-3) 


(A-4) 
(A-5) 
(A-6) 
(A-7) 
(A-8) 
(A-9) 
(A-10) 
(A-l 1) 
(A-12) 
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(A-13) 



dE / 2jt\ 

(1 — ecos E) \ /? ) 

cosEdE 

(1 — ecosE) \e/3j { P) \0 ) 
cos 2 EdE 


(1 — e cos E) 

cos 3 EdE 
(1 — ecos E) 

cos 4 EdE 
(1 — e cos E) 

E sin EdE 


=(*)< i -» , < 2+ »=(^) 

_/2»v r i+/? i 

(1 — ecos jE 7) \ e / n 1 2(1 + e) / 
EsinEcosEdE /2tt\ [" 1 ( 1+/? 11 

(1 — ecos £7) " V e 7 L * \2(l + e)/J 


T 2 (2 + /9) 


E'sinJE'cos 2 EdE 
(1 — e cos£) 


-&)[- 




i+/? 
2(1 + e) 


}]] 


(A-14) 
(A- 15) 
(A-16) 
(A-17) 
(A- 18) 
(A-19) 
(A-20) 


For a full development of the following integral, see Appendix B. 


r E 2 dE 

L t (1 — e cosl?) 



(A-21) 


where 


Continuing, 


5(7) = 


3 n 2 



E 2 cos 2 EdE f 2w\ [ ( 

(1— ,co»E) = (^jl 5(l,)+<; ( 2e -T)j 

dE _ /2i\ 

(1 — ecos E) 2 \/? 3 / 

cos EdE _ / 2ire \ 

(1 — ecosi?) 2 \ (3 3 ) 




(A-22) 

(A-23) 

(A-24) 

(A-25) 

(A-26) 
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£ 

£ 

£ 

£ 

£ 

£ 

£ 

£ 

£ 

£ 

£ 

£ 

£ 

£ 

£ 

£ 

£ 


= (£)*»+*« 


cos 3 

(1 — ecos l?) 2 

(!*£!£)» = (^ 5 ) (1 - « !(2 + v ^ - ^ = fe ) " ! < 2 + ^ - 2<J! - 

EsinEdE /2tt\ T 1 1 1 /2x\ , 

= (tJ b-i^J = UJ< 1 - T > 

1 + /? 


(1-ecos#) 2 

E sin E cos EdE 
(1 - ecos £) 2 

E sin E cos 2 EdE 


f?*\ [1 1 . f 1 + /? V 

~Uvb 1 + e + n \ 2(1 + e) J 


'2*\ n 

e 3 


1 


(l-ecosi?) 2 V e3 7L/? 1 + e 6 + { 2(1 + e ) } 


E 2 dE 


(1 — e cos E) 2 

dE 

(1 — e cos E) 3 

cos EdE 
(1 — e cos E) 3 

cos 2 EdE 


= (^) [s( 7 )- 4 / 31 n(l + 1 ) 




- ( Z7re \ 
~ U 5 ) 


(1-ecosE) 3 ~ U 5 J (3 ^ 


COS' 




(1 T e "g)» ■ ^ J (* ■ - + *» + 2 ^ - 2 « = (£) + + 2 ^ - 2 « 

■ (&) + * - 2 ^ ■ (&) + 2<J - *”> 

EsinEdE /ir\ T 1 1 

“ W [ P 3 ~ (1 + e) 2 . 

(1 + 2e)1 


(1 — e cos E ) 3 
i^siniJcos EdE 


(1 — ecos l?) 3 (e 2 ) [/? 3 ^ 2/? ^ + (l + e) 2 J 

Esin Ecos 2 EdE _ / * \ M_ _ 2) (3 + 4e) _ f 

(1 — e cos E) 3 ~ L 3 / [/? a( ** + (1 + e) 2 \ 

E 2 dE 


1 + P 


;}] 


(1 + e) 2 '“‘l2(l + e) 

(1 - e cos £)* = (?) [< 3 “ ^ - W1 " (1 + T) - 2e ' J<1 - T > 

E 2 cos EdE _ ^JLj | 3( 1 - - 4,3(3 - 2,3=) ln( 1 + - 2c/J( 1 - 

(l-«cos£)4 = (??) [(3-5^ + 2rtS(r)-4«3-4?)1n(l + 7)-2 e «l- T ) 


(1 — e cos i?) 3 

E 2 cos 2 EdE 


(A-27) 

(A-28) 

(A-29) 

(A-30) 

(A-31) 

(A-32) 

(A-33) 

(A-34) 

(A-35) 

(A-36) 

(A-37) 

(A-38) 

(A-39) 

(A-40) 

(A-41) 

(A-42) 

(A-43) 
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(A-44) 


h (5-4/? 2 ) 


r cos 3 EdE 


cos 4 EdE 


= (w>- 


(1 - /?) 2 ( 5 + 10/? - 10/? 3 - 4/? 4 + 2/? 5 ) 
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(1 4- 3e) 
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vl2e 3 / 

E 2 dE 

fM[ 3 

(1 - ecos E)* 

Vi2/?v l 3 

— 2e/?( 1 - 

E 2 cos EdE 
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(1 — e cos E ) 5 
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Appendix B 
Required Integrals II 

Out of all these difficult integrals, only one class is not expressible in closed form. These are integrals of the form 


E 2 cos m EdE 
_ x (1 — ecos E) n 


(B-l) 


We shall confine our interest to the lowest order of the above, namely where n = 1 and m = 0. The other integrals can 
be simply obtained from this one, which is 


-L-t 


E]dE_ 

ecosE 


(B-2) 


If the square function is expanded in a Fourier series over the interval — 7r to tt, one obtains 


E 2 = - 4 [cos E - cos 2E + ^ cos 3E j , —w < E < 7r 


(B-3) 


This now can be made to yield the integral we are seeking: 



E 2 dE 
1 — e cos E 


r — d JL — +4 f c ° sn£ ) dE 

3 y_ T l-ecos E ” 2 V 1 - e cos EJ 



cos nEdE 
1 — e cos E 


Thus, 



cos nEdE 
1 — e cos E 



cos nEdE 
1 — e cos E 


This latter integral can be found in [1; p. 366, 3.613] as 
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P V 

e ) 


(B-4) 

(B-5) 


(B-6) 


(B-7) 


24 


' y 

TDA Progress Report 42-98 


'IbSfc p$i 

N90-12788 

August 15, 1989 


A Statistical Study of Radio-Source Structure 
EfFects on Astrometric Very Long Baseline 
Interferometry Observations 

J. S. Ulvestad 

Tracking Systems and Applications Section 


Errors from a number of sources in astrometric Very Long Baseline Interfer- 
ometry (VLBI) have been reduced in recent years through a variety of methods 
of calibration and modeling. Such reductions have led to a situation in which the 
extended structure of the natural radio sources used in VLBI is a significant error 
source in the effort to improve the accuracy of the radio reference frame. In the 
past j work has been done on individual radio sources to establish the magnitude of 
the errors caused by their particular structures . This article reports the resuits of 
calculations on 26 radio sources in which an effort is made to determine the typ- 
ical delay and delay-rate errors for a number of sources having different types of 
structure. 

It is found here that for single observations of the types of radio sources present 
in astrometric catalogs , group-delay and phase-delay scatter in the 50-100 psec 
range due to source structure can be expected at 8.4 GHz on the intercontinental 
baselines available in the DSN. Delay-rate scatter of ~ 5 x 10~ 15 sec sec~ 1 (or 
~ 0.002 mm sec ~ 1 ) is also expected. If such errors mapped directly into source 
position errors, they would correspond to position uncertainties of ~ 2-5 nrad 
(0.5-1 milliarcsec, or 0. 1-0.3 microdeg), similar to the best position determina- 
tions in the current JPL VLBI catalog. With the advent of wider bandwidth VLBI 
systems on the large DSN antennas, the system noise will be low enough so that 
the structure-induced errors will be a significant part of the error budget. Several 
possibilities for reducing the structure errors are discussed briefly, although it is 
likely that considerable effort will have to be devoted to the structure problem in 
order to reduce the typical error by a factor of two or more. 


I. Introduction 

Very Long Baseline Interferometry (VLBI) is a radio 
astronomical technique which can provide exceptionally 


high resolution (~ 5 nrad) for observations of compact nat- 
ural radio sources. It involves simultaneous observations 
of a radio source by two or more observatories that may be 
separated by a large fraction of the Earth’s diameter, or 
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Thus, finally 


where 



cos nEdE 
1 — e cos E 



(B-8) 


s( 7) = 

T 2 A 4^(-i) n „1 

U + 4 V„= 

n = 1 

(B-9) 

/? = \/ 1 - e 2 

(B-10) 

7= 1 

(¥) 

(B-ll) 


Reference 

[1] I. S. Gradshteyn and I. M. Ryzhik, Tables of Integrals, Series and Products, 
Corrected and Enlarged Edition, New York: Academic Press, Inc., 1980. 
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by more than an Earth diameter if one of the observatories 
is in space. The uses of VLBI currently fall into two major 
categories. The astrophysical uses involve learning about 
the objects studied because of interest in their physics, the 
structures, changes, and motions of compact components 
in natural radio sources. The other major use for VLBI is 
astrometric and geodetic in nature; it involves the study 
of sources not for their own sake, but for their use as ref- 
erence beacons in studies of Earth orientation and plate 
tectonics as well as spacecraft navigation. The two cat- 
egories of VLBI investigations are complementary in the 
sense that improved knowledge of the properties of natural 
radio sources can be used to better understand and utilize 
their performance as celestial beacons. 

The natural radio sources used as beacons are very 
distant, extragalactic objects. For the purposes of this 
article, they will often be referred to by the generic name 
of quasars , although they may also include distant radio 
galaxies, BL Lacertae objects, and some sources whose 
specific optical counterparts have not yet been identified. 
The quasars are so distant that they can be used to define 
an inertial reference frame. Spacecraft motions within the 
solar system and motions on the surface of the Earth can 
be found with respect to that reference frame. 

Most astrometric VLBI work has proceeded under the 
assumption that the quasars are point radio sources with 
positions that are constant in time and do not shift as ob- 
servation parameters change. Although it has long been 
known that this is only an approximation, it has been an 
appropriate assumption as long as the errors it introduces 
have been significantly smaller than the errors introduced 
by other aspects of the VLBI observations. Some other 
error sources in astrometric VLBI are imperfect knowl- 
edge of Earth orientation parameters, station location un- 
certainties, the static and fluctuating troposphere, VLBI 
instrumentation, and tidal effects. More detailed analy- 
ses or expected levels of these and other errors can be 
found in [1-3] and will not be repeated here. VLBI has 
now reached the stage where the above errors can be cali- 
brated or modeled well enough so that the residual errors 
on interferometric delay measurements on intercontinental 
baselines are in the <100 psec range. 

With the use of VLBI systems that enable high sam- 
pling rates and large spanned bandwidths (e.g., the Mark 
III system, [4]), expected delay errors using DSN antennas 
are considerably less than 100 psec. On intercontinental 


baselines, errors of approximately 150 psec of delay corre- 
spond to position uncertainties of about 1 milliarcsec [1]. 
However, many of the quasars used for astrometric and 
geodetic VLBI show structure on scales of several milliarc- 
seconds. This structure is known to change with time, and 
its effects on VLBI observables also depend on the base- 
line length and orientation in a given observation. Thus, it 
has been thought that delay errors caused by source struc- 
ture are at roughly the 100 psec level on intercontinental 
baselines. This is particularly true in the case of VLBI 
catalogs that are used for spacecraft navigation. Because 
of the need for a large number of radio sources distributed 
around the sky with a high density of sources in the eclip- 
tic plane, it has not been possible to be selective enough 
to eliminate all the radio sources in which structure may 
have an effect of ~ 100 psec. 

The purpose of the work described in this article is 
to use the intensity distributions of real radio sources to 
estimate the magnitude of the error that is made by ob- 
serving these resolved radio sources. Maps of the structure 
of a number of compact radio sources are used to study 
their effects on source position estimates. Rough error es- 
timates for several different quantities are made and their 
dependence on several parameters is also studied. 

II. Important Observed Quantities in 
Astrometric VLBI 

Observed quantities in astrometric VLBI include 
group delay (or bandwidth synthesis delay), phase delay, 
delay rate, and correlated amplitude [1]. The delay is de- 
fined as the difference between the arrival times of a radio 
signal at two stations. In the case of quasars, the radio sig- 
nal is a noise-like, broad-band signal rather than a narrow- 
band tone or set of tones as is the case for a spacecraft. 
The component of delay that is sought for understanding 
quasar and observing-station locations is the geometric de- 
lay , which is the delay caused by the combination of the 
geometry of the baseline and the direction to the radio 
source. Various effects mentioned in Section I can cause 
errors in the determination of the geometric delay. 

Most of the important observed quantities have been 
defined and explained in [5], and the reader is referred to 
that article for more detail than given below. Tradition- 
ally, the group delay has been used and is defined as 
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d<t> 

T 3- du, 


(5) 


( 1 ) 



Here, <j> is the relative phase of the radiation between the 
two stations and w is the angular frequency (u; = 2nv, 
where v is the sky frequency in units of Hz). Although the 
group delay is formally defined as a derivative, it is actually 
determined as a differential over a small frequency range: 



( 2 ) 


A second quantity, not traditionally used in astro- 
metric VLBI, is the phase delay , which is defined by the 
formula 



(3) 


Phase delay is generally considered to have the potential 
for increased accuracy relative to group delay because of 
the “leverage” that is provided by having the total sky 
frequency in the denominator. However, phase delay tra- 
ditionally has not been used because of the difficulty of 
making the correct a priori resolution of cycle ambiguities 
[5], 


The delay rate is a third important astrometric quan- 
tity; it is currently used along with group delay in most 
astrometric VLBI. This quantity is defined as the time rate 
of change of the phase delay: 


. _ dr $ _ 1 d<t> 

T +~!)t~ udt 


(4) 


The delay rate will be used for VLBI tracking of the Mag- 
ellan spacecraft. 

A fourth quantity that is often ignored in astrometric 
VLBI will be introduced here, because it can provide im- 
portant clues to the appearance of errors caused by radio- 
source structure. That quantity is the correlated amplitude 
of the radio source signal. The correlated amplitude (or 
visibility amplitude) is defined and discussed extensively 
in many works such as [1]. It may be expressed in units 
of flux density, as a correlation coefficient, or as an ampli- 
tude relative to the maximum possible. In this article, the 
visibility amplitude will be used, and is defined by 


Here, S c is the correlated amplitude (or correlated flux) 
and S t is the total amplitude (or total flux). 

For a point radio source, the visibility amplitude has 
a maximum value of one. When radio emission comes from 
an angular area whose dimensions are comparable to A /D, 
where A is the observing wavelength and D is the baseline 
length, the radio source is said to be “resolved.” That 
is, it shows interference properties different from those ex- 
hibited by a point radio source. The emission from differ- 
ent directions causes a “washing out” of the interference 
pattern, because the cross- correlations of radiation from 
different parts of the source interfere destructively for the 
two observing stations. Then the visibility amplitude (or 
fringe visibility) is reduced. Since the amount of resolution 
changes as Earth rotation alters the baseline orientation, 
the visibility amplitude for a specific radio source varies 
with time on a given baseline. This amplitude is an im- 
portant indicator of the magnitude of the source-structure 
effect on measured delays and delay rates, as will be de- 
scribed further below. 


III. Review of Source-Structure Effects 
on Astrometric VLBI Observations 

Extensive analysis has been done to determine how 
the non-pointlike structure of radio sources affects the 
quantities observed in astrometric VLBI [6, 7], and will 
not be repeated here. Typically, the structure-induced 
group-delay error is largest for a given quasar when the 
visibility amplitude is lowest, or when the interferometer 
baseline “heavily resolves” the radio source. 

In order to examine trends in the structure-induced 
delays, it is useful to introduce the concepts of the (u,v) 
plane and the inteferometer hour angle (IHA). (Again, see 
[I].) For a single telescope, the hour angle is the east- west 
angle of the source relative to the local meridian. When 
a source transits the local meridian moving from east to 
west, its hour angle is said to be zero. One hour earlier, 
the hour angle was —15 deg, or —1 hour; one hour after 
transit, the hour angle is -f 15 deg, or +1 hour. The IHA 
for two separated telescopes is defined analogously, with 
the “local” meridian being defined as the meridian at the 
midpoint in longitude between the two observing stations. 
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The (u, v) plane is the plane perpendicular to the di- 
rection from the Earth’s center to the radio source, with 
u increasing to the east and v increasing to the north. 
As the Earth rotates, the projected interferometer base- 
line in the (u,i>) plane changes, which changes the inter- 
ferometer response to any extended structure in the radio 
source. The evolution of the (u, v) coordinates of the base- 
line is calculable solely from the geometry of the locations 
of the observing stations and the celestial coordinates of 
the radio source. These coordinates are normally mea- 
sured in units of wavelengths of the observing frequency, 
so that 1 /y/u 2 4* v 2 gives the number of wavelengths in an 
interference fringe, which is the approximate spatial res- 
olution (in radians) of the interferometer. Figures 1 and 
2 show plots of the (u, v) points sampled for two different 
sources, one on the California-Spain baseline and one on 
the California-Australia baseline. 

The structure phase delay tends to be constant when 
the structure group delay is small, in regions of the (it, u) 
plane where the source visibility amplitude is high. How- 
ever, this phase delay is not necessarily the same as it is 
for other projections of the same baseline that also have 
high visibility. If the (u,t>) track for a given source and 
station pair passes through a low-visibility region between 
the two high-visibility regions, the phase delay will change 
substantially in the low visibility region. This means that 
the structure-induced delay rate should also be large in 
regions of low visibility. Examples of such behavior will 
be discussed in Section V. 


IV. Calculations of Source-Structure 
Errors for Real Radio Sources 

The investigation of source structure using VLBI has 
made substantial progress in the last 10 years, as improved 
calibration, mapping, and modeling procedures have been 
developed. It has become possible for astrophysicists to 
map large numbers of compact radio sources for a variety 
of purposes. One such investigation that was begun more 
than 10 years ago is a 5-GHz mapping survey of strong 
extragalactic sources that has been carried out by the Cal- 
tech VLBI group [8, 9]. This survey is intended to make 
maps of approximately 45 sources at several epochs to 
study their overall properties and their structural changes 
with time. First-epoch models of 26 sources were kindly 
made available by T. Pearson to enable the author to study 


the statistics of the structure effects on measured group de- 
lay and phase delay. VLBI maps of many of these objects 
are shown in [9]. 

The available models of compact VLBI sources are 
derived by measuring the complex source visibilities at 
a large number of (u,r) points, Fourier-transforming the 
data to make maps of total intensity versus position in 
the sky, then deconvolving these maps into superpositions 
of point sources. The fluxes and positions of a set of 
such point sources form a model of a single complex ra- 
dio source. More detailed information on mapping and 
modeling procedures is available in [10]. 

For each of the 26 sources studied in this work, the 
(ti,f) tracks on the Deep Space Network baselines between 
California (Goldstone) and Spain (Madrid) and/or Califor- 
nia and Australia (Tidbinbilla) were computed. (Since the 
survey sources are fairly far north, they are all visible on 
the baseline to Spain, but many are not visible on the base- 
line to Australia.) The (u,v) tracks were sampled every 3 
min, which is roughly the duration of a typical astromet- 
ric VLBI observation of an individual source. The source 
models were then used to compute the group delay, phase 
delay, and delay rate caused by structure at each sampled 
point, using the N-point-source formalism described in [6]. 
All quantities were computed relative to the flux-density 
centroid (“center of gravity” of the source flux) of each 
individual source, and computations were made only at 
8.4 GHz. 

Most astrometric VLBI makes use of observations at 
both 2.3 GHz and 8.4 GHz, but corrections for charged- 
particle propagation lead to the 8.4-GHz data having ap- 
proximately 13 times greater weight than the 2.3-GHz 
data. Flux-density centroids at the two frequencies may 
differ by as much as 5 nrad, leading to a possible error 
of up to about 10 psec in the charged-particle correction 
to the delays. The interference phenomena will also dif- 
fer significantly, with larger source sizes at 2.3 GHz being 
partly offset by the poorer spatial resolution at the lower 
frequency (see below). Again, this may be absorbed in the 
charged-particle correction, and will probably cause an er- 
ror of no more than about 30 psec of delay in the worst 
cases. As seen from the results presented in Section V, 
this means that most of the effect of source structure can 
be found by doing studies at 8.4 GHz alone. 

A number of additional assumptions were made in 
computing the effects caused by structure. First, it was 
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assumed that the source structures at 8.4 GHz were the 
same as those at 5 GHz. It is well known that this is 
not true, since there are substantial variations in spectral 
index in different parts of VLBI sources. Typically, there 
is one radio component that is identified as the “core” of 
the quasar by virtue of the fact that its spectrum is “flat” 
or “inverted” (i.e., the component strength at 8.4 GHz 
is greater than or equal to its strength at 5 GHz). The 
rest of the VLBI source typically shows a “steep” (flux 
falls off with increasing frequency) spectrum. Based on 
5-GHz maps alone, it is not always possible to identify the 
core component. Most sources also do not have detailed 
spectral information available, so it is not valid to attempt 
to correct for the source spectrum in making calculations 
at 8.4 GHz. 

Another important approximation arises in comput- 
ing source structure effects along (u,v) tracks that were 
not actually sampled in the VLBI observations used in 
making the original source maps. Use of the models de- 
rived from baselines that are generally shorter than the 
DSN intercontinental baselines amounts to making an ex- 
trapolation in the (u, v) plane to determine the source vis- 
ibility. Since a baseline between Owens Valley, California 
and Bonn, Germany was used in making most of the maps, 
the extrapolation is not extremely large in making calcula- 
tions on the Goldstone-Madrid baseline. However, a sub- 
stantial extrapolation is involved in making calculations 
on the Goldstone-Tidbinbilla baseline. 

Finally, there is an approximation in the use of the 
derivative rather than the differential in computation of 
the group delay. For bandwidth synthesis observations 
using the Mark II VLBI system and the DSN, the maxi- 
mum spanned bandwidth is typically 40 MHz. This is less 
than 0.5 percent of the 8.4-GHz observing frequency, so ap- 
proximating the differential as a derivative is fairly good. 
However, the spanned bandwidth of ~ 400 MHz typically 
used in the Mark III VLBI system approaches 5 percent 
of the observing frequency. Then, using the derivative at 
a single point is a less accurate approximation. Although 
the group delay may be very large at a single frequency, it 
can be significantly smaller at a frequency 300 MHz away. 
Then, the use of the derivative instead of the differential 
would tend to overestimate the effect due to structure. For 
phase delay, the difference between the smaller and larger 
spanned band widths is unimportant. 

The data do not exist to enable us to make a quan- 
titative study of the errors introduced by the above ap- 


proximations. Qualitatively, the use of 5-GHz maps for 
calculations made at 8.4 GHz will probably lead to an 
overestimate of the group delay caused by structure. Since 
sources tend to be more dominated by a single component 
at 8.4 GHz than they are at 5 GHz, they show less of an 
effect due to the steep-spectrum extended emission. The 
neglect of the effects at 2.3 GHz would cause an underesti- 
mate of the structure effects, but the low weighting of this 
frequency in the charged-particle correction means that 
this underestimate is of relatively minor importance. The 
extrapolation in the (u,i>) plane means that some compo- 
nents that might be resolved on long baselines at 8.4 GHz 
are not separated in the models used, leading to a possi- 
ble underestimate of the effects due to structure. Finally, 
as mentioned already, the approximation of a differential 
as a derivative is likely to lead to an overestimate of the 
group-delay error in some cases. 

It is clear that the results of the calculations presented 
below can be used only as statistical indicators of the mag- 
nitude of the source-structure errors and cannot be used 
to make detailed statements about specific cases. Overall, 
the magnitude of the estimated errors is probably within 
a factor of 1.5-2 of the “real” errors induced by structure. 
This error estimate is derived by procedures such as using 
known spectra of parts of sources that have been mapped 
at several frequencies to convert from their structures at 
5 GHz to those estimated structures at 8.4 GHz, and then 
recomputing the structure-induced delay errors. 

V. Statistical Results 

A. Some General Considerations 

A variety of statistical quantities can be computed 
from the source structure effects found for the individual 
sources in the sample. The relevance of each quantity de- 
pends on the specific circumstances under which a source 
is observed and the way the VLBI data are analyzed. For 
example, if a single observation is made of a source for 
which there is no a priori information about structure, 
the scatter about zero structure effect (“root-mean-square 
about zero”) will give the typical deviation of quantities 
from those measured at a nominal position given by the 
source centroid. But if a large number of measurements 
of a source have been made at a variety of (u, v) points, 
the a priori source position may be offset from its flux 
centroid by an amount governed by the average structure 
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effects in the previous observations. Then, the tradition- 
ally computed root-mean-square would give the scatter on 
top of an average bias that has been built into the a pri- 
ori values. Of course, since structure effects may be ab- 
sorbed by other parameters in the multi-parameter fit that 
gives source positions, the a priori position will not be ex- 
actly that given by the centroid plus the mean structure 
effect. Further, the offset from the centroid would depend 
on which astrometric quantities were used in deriving the 
a priori position. 

A further consideration arises from the fact that the 
largest group-delay and delay-rate effects occur at points 
with low visibility amplitudes. However, some of these 
points will not exist in the final data set, because the low 
amplitudes may mean that interference fringes are not de- 
tected, and no valid measurement is acquired. For ex- 
ample, a source with a total flux density of 0.8 Jy and a 
visibility amplitude of 0.1 is probably too weak to show 
fringes in a Mark II VLBI experiment using a 34-m and 
a 70-m antenna. If weak fringes are detected, the low- 
amplitude points will be lightly weighted because of their 
low signal-to-noise ratio. The overall reduced importance 
of the low-visibility points, where the largest offsets from 
the flux centroid would appear, makes it likely that the 
a priori position of a source will be very close to its flux 
centroid. Therefore, the scatter about zero structure ef- 
fect should be the most meaningful quantity in estimat- 
ing expected errors. Although scatters about both zero 
and about the mean have been computed for the source- 
structure effects, the scatter about zero will be the quan- 
tity quoted throughout the remainder of this section. The 
reader should keep in mind the fact that the structure er- 
rors are not Gaussian-distributed, so the scatters quoted 
below should not be thought of as the parameters of a 
standard error distribution. 

B. General Results for Deep Space Network 
Baselines 

Table 1 lists the 26 sources that were used in making 
calculations of the errors caused by source structure. This 
table includes a classification of the type of structure each 
source contains, roughly corresponding to those given and 
described in [8, 9]. Note that these 26 sources fall into 
5 different morphological classes; those divisions will be 
used below in determining which types of sources cause 
the greatest errors. 


Figures 3-18 give samples of the computed structure 
effects along the (tx, t;) tracks on DSN baselines for several 
different sources. Each figure shows the visibility ampli- 
tude and structure-induced delays and rates for a source, 
as functions of IHA. Note, as expected, that the largest 
group-delay and delay-rate errors occur near visibility min- 
ima, while the phase delay changes rapidly at these points 
and thus is different on opposite sides of the visibility min- 
ima. As mentioned previously, recall that all quantities are 
computed relative to the flux centroid of each source. 

Figures 3-6 show results for a double radio source on 
the baseline between California and Australia. This source 
is visible on the given baseline for approximately 4 hours. 
There are two regions where the visibility amplitude is 
below 0.05; the structure-induced group-delay and delay- 
rate errors at this point are impressively large. The phase- 
delay error has a large derivative at these points, but a 
large phase-delay error due to structure is not accumulated 
over the 4 hours of tracking. 

Figures 7-10 show results for a circumpolar, double 
source on the baseline between California and Spain. The 
separation of the main components in this source is more 
than 50 nrad, which is many times larger than the resolu- 
tion of the baseline. There are a large number of minima 
in the plot of visibility amplitude, and each corresponds to 
a spike in group delay and in delay rate. The phase delay 
changes substantially at each minimum and accumulates 
at the rate of approximately 100 psec per hour of observ- 
ing, with a maximum value of over 700 psec caused by the 
source structure. 

Figures 11-14 show results for a circumpolar asym- 
metric source that is fairly well dominated by its unre- 
solved core. The visibility amplitude never drops below 
about 0.4. The corresponding group-delay and phase- 
delay errors are always below 100 psec, in contrast to 
the results shown for the relatively large double source 
in Figs. 7-10. 

Figures 15-18 give results for another asymmetric 
source on the baseline between California and Australia. 
The only points below 0.1 visibility amplitude occur be- 
tween 3.2 and 3.3 hours IHA, with other points below 0.2 
occurring about an hour later. The group-delay and delay- 
rate errors are maximum at these points. But, it should be 
noted that the dependence of the group delay on the vis- 
ibility amplitude is dramatic at these local visibility min- 
ima. For a 33 percent decrease in visibility amplitude, 
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the structure-induced group delay grows by a factor of al- 
most 3. The delay-rate error shows a much more modest 
increase as the visibility amplitude goes down. 

Table 2 shows the structure-induced scatter (about 
zero) in astrometric VLBI quantities on the Califomia- 
Spain baseline for all 26 sources as a function of visibility 
amplitude. Table 3 shows similar results for the 12 sources 
visible on the California-Australia baseline. (Note that 
there are many fewer (u, v) points used on the latter base- 
line, so the conclusions are based primarily on the results 
for the California-Spain baseline.) As expected (see [6]), 
there is a strong dependence on visibility amplitude, with 
the lowest amplitude points giving the greatest errors in 
group delay and delay rate. The group-delay results are 
roughly comparable between the two baselines, but the 
phase-delay and delay-rate errors are much higher on the 
baseline to Spain. This difference can be explained by the 
long (u, v) tracks sampled on this baseline for circumpolar 
sources. Returning to Figs. 1 and 2, the plots show that a 
much wider variety of (u, v ) points was sampled for the cir- 
cumpolar source on the Spain baseline than for the other 
source on the Australia baseline. Since a larger variety of 
baseline projections was sampled, the phase contributed 
by source structure can wind through a larger number of 
cycles, giving a greater deviation from zero phase. Such 
an effect can be seen by examination of Figs. 7-10. 

As a check on the effect caused by circumpo- 
lar sources, one can divide the Spain data according to 
the source declinations. There are 9 sources that are cir- 
cumpolar at both antennas, having declinations above 60 
deg, and 17 sources at lower declinations. Approximately 
4000 (u, v) points are sampled for each set of sources. Ta- 
ble 4 gives the scatters about zero for the circumpolar and 
“other” sources, and shows a striking result. Although the 
typical group-delay errors are actually somewhat larger for 
the more southerly sources, the phase-delay scatter is a fac- 
tor of about 4 higher for the circumpolar objects, and the 
delay-rate scatter is a factor of 2 higher. This implies that 
a catalog that is constructed using phase-delay measure- 
ments between northern hemisphere antennas may have 
much more structure-induced scatter in the phase delay if 
a significant number of circumpolar sources are observed 
at a wide range of hour angles. This scatter can be reduced 
if such sources are only observed over a narrow range of 
hour angles, so that the phase does not wind through a 
large number of cycles. However, since the observations 
at a wide range of hour angles can be useful in isolating 


the effects of parameters other than source structure, the 
costs and benefits of such a limitation are unclear. Another 
possibility would be to derive different source positions for 
different ranges of hour angle, although the individual po- 
sitions would be of lower accuracy because of the reduced 
number of data points included for each. 

Tables 5 and 6 show the effect of arbitrary data cut- 
offs based on source visibility amplitudes. Again, both 
California-Spain (Table 5) and California-Australia (Ta- 
ble 6) data are shown. These tables give the scatter for all 
those points above a specific value of the visibility am- 
plitude. It is seen that cutting out the lower-visibility 
points dramatically reduces the group-delay error caused 
by structure, significantly reduces the delay-rate error, but 
has much less effect on the phase delay. This confirms the 
expectations that group-delay and delay-rate errors are 
highest at the low- visibility points, whereas phase-delay 
errors are less correlated with visibility amplitude. 

Table 7 shows the scatters for the total set of 26 maps 
and for each class of sources. (“Good” and “bad” sources 
are defined below.) Only the California-Spain baseline 
is shown, as there are not enough objects visible on the 
California-Australia baseline to have a sufficient number 
of sources in each class. Here, it is important to note 
that there are certain classes of radio-source structures 
for which the errors caused by structure are substantially 
larger than for other classes of sources. In particular, the 
“compact-double” (here, “compact” typically means 20- 
100 nrad) and “steep-spectrum-core” sources show much 
larger structure errors than the other types of sources. 
Group-delay errors for the “unclassified” sources are large 
solely because of the inclusion of 3C 84, which is well 
known to have significant structure on a scale of about 
100 nrad. 

Most sources whose structure would cause delay errors 
of hundreds of psec are not included in VLBI catalogs [11, 
12] that are used for astrometry and geodesy. Those for 
which observations have been attempted either have not 
shown strong (or any) interference fringes on the intercon- 
tinental baselines because of insufficient compactness, or 
have given large residuals in analyses of VLBI data. In ei- 
ther case, such sources would not remain in catalogs. For 
example, the compact double source 2021 + 614 (OW 637) 
was observed in early JPL VLBI experiments; because of 
its structure, however, it is no longer a prime observing 
candidate. 


32 


The compact-double sources, steep-spectrum-core 
sources, and 3C 84 can be defined as “bad” sources from 
an astrometric standpoint. The basic definition of “bad” 
can be made solely from their radio morphologies, without 
any actual calculations of the structure effects. Eliminat- 
ing them from those listed in Table 1, we are left with a 
set of 16 sources that are probably more representative of 
those that have survived a sieving process for inclusion in 
astrometric catalogs. These are the sources defined to be 
“good” astrometric sources. Indeed, of these 16 sources, 8 
of them are in the latest published JPL VLBI catalog [12]. 
Analysis of the data for the 16 “good” sources alone gives 
scatter in the delays and rates that is a factor of 3-7 less 
than for the entire set of 26 sources, as shown at the bot- 
tom of Table 7. These numbers are probably much more 
representative of the structure errors that may be present 
in the JPL VLBI catalog. 

Of the 10 “bad” sources eliminated, only 3C 84 is in 
the latest JPL VLBI catalog. It is only present because, 
despite its very low visibility amplitude, the source flux is 
so high that the correlated flux is still detectable even at 
points of very low visibility. For observers on Earth, there 
are no other sources comparable to 3C 84 in strength and 
structure, so its properties cannot be used to generalize to 
other sources that might be in VLBI catalogs. 

Tables 8 through 12 show the same results as in Ta- 
bles 2 through 6, but with only the “good” sources in- 
cluded. The same trends with visibility and source dec- 
lination are indicated, although they are somewhat less 
marked than for all 26 objects because of the overall re- 
duction in the scatter for these sources. A comparison of 
Tables 2-3 and 8-9 shows that the good sources have many 
fewer low-visibility points, as would be expected. But even 
at the low-visibility points, the structure effects are much 
less severe than for the “bad” sources at points with simi- 
lar visibility amplitudes. This is because the “bad” sources 
are generally many resolution elements in size; as pointed 
out in [6], for points with the same visibility amplitude, 
larger sources give more pronounced structure effects than 
do smaller sources. 

Comparison of Tables 4 and 10 shows that the dif- 
ferences in phase delay and delay rate between circum- 
polar and more southerly sources is less pronounced for 
the “good” sources than for all sources. This is probably 
because the “good” sources are not as many resolution el- 
ements in size. Therefore, as the baseline rotates under 


the radio sources, the phase does not wind nearly as much 
for the smaller sources. Figures 11-14 show results for 
a “good” circumpolar source and illustrate the relatively 
small accumulation of phase-delay error in such a case. 

The results given in Table 5 for all sources and in Ta- 
ble 11 for “good” sources are quite different for the low 
visibility cutoffs, but converge to similar values in group 
delay and delay rate at the higher visibility cutoffs. This 
convergence comes about because there are few points of 
high visibility in the “bad” sources, so the data largely 
overlap. Despite the overlap, there is still a large difference 
in phase-delay results, even at visibilities of 0.6 and higher. 
This is due to the fact that the phase delay at a given (u, v) 
point depends on the phase-winding at all other points for 
the same source. Even for a high-visibility point in a “bad” 
source, the connected phase may wind through several cy- 
cles at visibility minima, so the residual phase delay at the 
high-visibility point can be several times a single X-band 
cycle ambiguity of 120 psec of delay. In Figs. 7-10, note 
that there are a number of (u, v) points with visibility am- 
plitudes near 0.6 where large phase-delay errors have ac- 
cumulated. It does not take many such points to raise the 
phase-delay scatter at moderate visibility amplitudes from 
the approximately 30-60 psec for “good” sources to the 
values of more than 100 psec found for the “bad” sources. 


C. Dependence of Structure Errors on Baseline 
Length 

Since astrometry and spacecraft navigation can be 
done on baselines shorter than the DSN intercontinen- 
tal baselines, it is of interest to estimate the delay errors 
caused by structure as a function of baseline length. For 
the 16 “good” sources, calculations have been made on the 
baseline between Haystack Observatory in Massachusetts 
and Fort Davis, Texas, as well as on the baseline between 
Haystack and Green Bank, West Virgina. The lengths of 
these baselines are 3140 km and 840 km, compared to the 
8390 km of the Goldstone-Madrid baseline. These shorter 
baselines were actually used in making almost all the maps 
in the sample. Of course, the observing frequency was 
5 GHz rather than 8.4 GHz, so the actual points 

used in the computations were not sampled. 

Figures 19-22 show the results for a single source on 
the baseline between Haystack and Fort Davis. Note that 
the visibility amplitude is considerably higher than for the 
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same source on the California-Australia baseline (Figs. 15- 
18). Figures 23-26 show a comparison for the two baselines 
with the same vertical scales for the errors. As expected, 
the typical group-delay, phase-delay, and delay-rate con- 
tributions due to the structure are considerably lower on 
the shorter baseline, as are the maximum excursions from 
zero. 


Table 13 shows a comparison among the three differ- 
ent baselines. (Preliminary results were presented previ- 
ously in [13].) The group-delay scatter caused by structure 
ranges from 5 psec on the shortest baseline to 83 psec on 
the longest baseline, with the corresponding phase-delay 
scatter being roughly comparable in size. Power-law fits 
of scatter versus baseline length give exponents of approx- 
imately 1.2 for the delays and rates over a factor of 10 in 
baseline length. However, the slope of the relation appears 
to steepen at the shorter baseline lengths; this is proba- 
bly because the “good” sources studied here are all poorly 
resolved by baselines with lengths of about 1000 km. Ta- 
ble 14 shows that the slope change for such baselines is 
much less pronounced when all 26 sources are considered, 
because some of these sources are large enough to remain 
well resolved even on the baseline between Haystack and 
Green Bank. It is not appropriate to use the available 
source maps to investigate effects at even shorter base- 
lines. Since shorter baselines were not used in acquiring 
the VLBI data that went into the models studied in this 
article, the observations were insensitive to the larger scale 
structure that is known to be present in many sources. 

The estimated scaling with baseline length is not al- 
ways appropriate for individual sources, but has a compli- 
cated dependence on the source structure and the inter- 
ferometer baselines. Scaling to shorter baselines certainly 
would not work for all objects. For instance, the compact- 
double sources are small enough that they would be mostly 
unresolved by baselines of on the order of 100 km, so the 
structure errors in group delay would probably be compa- 
rable to those of the sources included in Table 13. Since 
the errors are much larger on the intercontinental base- 
lines in the double sources, the scaling with baseline length 
would be different. This points out the fact that any time 
baseline lengths are changed by a large amount (say, a 
factor of about 2), different sources and classes of sources 
may be appropriate astrometric targets. More sources can 
be used on shorter baselines than on the intercontinental 
baselines of the DSN. Consideration of the properties of 
these additional sources would be necessary to determine 


the magnitude of structure errors that might be expected, 
for instance, on the 253-km baseline between Goldstone 
and Owens Valley, California. 


VI. Discussion and Conclusions 

A statistical investigation of the effects of compact- 
radio-source structure on astrometric measurements has 
been carried out using models of 26 different radio sources. 
The calculations show that for the entire sample of sources, 
with observations made on intercontinental baselines, the 
structure-induced scatters in group delay and in phase de- 
lay amount to hundreds of psec. These variations with in- 
terferometer hour angle correspond to several milliarcsec 
of position error. However, when sources that are unlikely 
to appear in astrometric catalogs are eliminated, the typi- 
cal delay errors caused by structure for individual observa- 
tions are reduced to between 50 and 100 psec, while delay- 
rate scatter is approximately 5 x 10“ 15 sec sec -1 . This 
is somewhat less than the residuals in typical interconti- 
nental Mark II VLBI experiments performed with DSN 
antennas, but is larger than the residuals for Mark III ex- 
periments. Such errors correspond to position errors of 
about 0.5-1 milliarcsec, comparable to the current accu- 
racy of the best positions in the JPL VLBI catalog. In our 
current VLBI data, source structure is sometimes obvious 
in the correlated amplitudes, but is not readily apparent 
in delay measurements. Signs of structure-induced delay 
residuals have been seen in astrometric Mark III VLBI 
data taken by others [14, 15] 1 and efforts are underway to 
identify such residuals in DSN data. 

It is interesting to note that quasar structure is unim- 
portant for the Magellan mission to Venus. The delay-rate 
scatter of 6.4 x 10" 15 sec sec -1 (or 0.002 mm sec -1 ) shown 
for the California-Spain baseline in Table 13 is 1-2 orders 
of magnitude lower than the expected error in Magellan 
AVLBI measurements. 2 The error for Magellan is domi- 
nated by the fluctuating troposphere, but there are other 
sources of error that are well above the expectations for 
source structure. 

1 Also see J. S. Ulvestad, “Possible source structure effects 
in IRIS data,” JPL Interoffice Memorandum 335.3-88-15 (in- 
ternal document), Jet Propulsion Laboratory, Pasadena, Cali- 
fornia, February 3, 1988. 

2 J. S. Border, “An Error Analysis for Magellan Differen- 
tial Delay Rate Measurements,” JPL Engineering Memoran- 
dum 335-96 (internal document), Jet Propulsion Laboratory, 
Pasadena, California, February 23, 1987. 
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The magnitude of structure-induced delay errors as a 
function of baseline length has also been investigated. The 
errors appear to increase slightly more than linearly with 
baseline length for the compact sources studied here. How- 
ever, it must be recognized that different sources have 
structure on a variety of scales. Therefore, the radio 
sources selected to have sufficient correlated amplitude to 
be seen on intercontinental baselines will tend to be poorly 
resolved on shorter baselines, leading to the reduced struc- 
ture effects. The dependence on baseline length may be 
different for sources that are not preselected to be highly 
compact. 

Several possible methods could be used to minimize 
source-structure errors in VLBI astrometry. The ideal ap- 
proach for construction of a catalog or for individual obser- 
vations of a spacecraft relative to a reference quasar would 
be to select only sources that show evidence of having com- 
pact structures that give very little error. As mentioned 
previously, the JPL VLBI catalog already consists almost 
entirely of sources that would be in the “good” category, 
having typical structure errors below 100 psec of delay. A 
further reduction in error could be achieved by construct- 
ing a VLBI catalog using the sensitive Mark III VLBI sys- 
tem, because there are many more detectable sources from 
which to choose those with minimal structure. However, 
such a catalog might not be of much use in spacecraft 
navigation using a narrow-bandwidth VLBI system such 
as the current 250-kHz-bandwidth Block I system. 3 This 
is because there are only a limited number of natural radio 
sources that can be detected with the narrow-bandwidth 
system. In any case, source structure is not much of an is- 
sue with a relatively insensitive system, both because the 
system-noise errors are higher than the source-structure 
errors and because the lowest amplitude points will be 
weeded out due to the lack of source detection in the cross- 
correlation. 

The newest JPL VLBI catalog contains only about 
20 sources with correlated fluxes on DSN intercontinen- 
tal baselines that are above 1 Jy at both 2.3 GHz and 
8.4 GHz. 4 For spacecraft navigation with the Block I sys- 

3 J. B. Thomas, “An Error Analysis for Galileo Angular 
Position Measurements with the Block I ADOR System,” JPL 
Engineering Memorandum 335-26 (internal document), Jet 
Propulsion Laboratory, Pasadena, California, November 1981. 

4 J. S. Ulvestad and O. J. Sovers, “Preliminary VLBI Cat- 
alog for Magellan,” JPL Interoffice Memorandum 335.3-89-14 
(internal document), Jet Propulsion Laboratory, Pasadena, 
California, January 30, 1989. 


tern, these sources are the only ones that can be detected 
on DSN baselines with two 34-m antennas. This is already 
too small a number of sources. Reduction of this sample 
size in order to minimize errors due to source structure 
would make the source density unacceptably small. Then, 
large errors could be caused by the reference quasars be- 
ing angularly far from the spacecraft being navigated. Of 
course, the problem of a low density of quasars could be 
reduced by using an increased bandwidth in the VLBI sys- 
tem used for navigation. 

Another method of reducing structure effects in build- 
ing a VLBI catalog is a selection based on visibility am- 
plitude. As expected, the group-delay and delay-rate er- 
rors are well correlated with visibility amplitude, while 
the phase-delay errors are less strongly related to visibil- 
ity. This confirms the supposition that the removal of 
low-visibility data can reduce source-structure effects in 
current astrometric catalogs, where group delay and delay 
rate are the observed quantities that are used. However, 
in experiments where phase connection is achieved and the 
potentially more accurate phase delay is used, such a sim- 
ple procedure will not reduce the structure effects greatly. 
Table 11 shows that a cutoff at a visibility amplitude of 
0.3 would reduce the group-delay and delay-rate scatters 
by about 30 percent on the Goldstone-Madrid baseline at 
the expense of about 14 percent of the data points going 
into a catalog. However, such a cutoff would have no ef- 
fect on the phase-delay scatter. It should be noted that 
any such selection of individual data points, or a selection 
of sources as described previously, would require careful 
amplitude calibration of the VLBI data. Such calibration 
generally is not carried out as a routine part of the gath- 
ering of astrometric VLBI data. It would require careful 
design of a system to monitor system temperatures both 
on and off source as well as additional steps in the data 
analysis process. 

Another possible method of reducing source-structure 
errors would be to model the sources being observed and 
use those models to compute corrections to the data. This 
is a difficult approach to implement, as it relies on hav- 
ing good models of the current structures of the radio 
sources at DSN observing frequencies. Such models can 
only be derived by making observations of each source at 
many different hour angles on many baselines, and then 
going through a time-consuming data-reduction process. 
The observations would need to be repeated on a regular 
basis at intervals of much less than a year. The antenna 
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time is not available for making such observations of many 
sources. Attempting to include simple model types in a 
parameter-estimation scheme may also be doomed to fail- 
ure, since the source-structure errors often depend on fine 
details of the radio morphologies that would be difficult to 
include in a parameter-estimation algorithm. 

Yet another method of reducing source-structure er- 
rors in individual VLBI measurements for spacecraft nav- 
igation would be to observe a number of sources in a local 
reference frame, as described in [16]. If all reference-source 
(and spacecraft) observations on different dates were made 
at the same hour angle, source structure would cause no 
change in the derived spacecraft position between those 
dates, as long as the source structures did not change be- 
tween observations. The assumption that structures do 
not change is probably good on time scales of a few months 
[17], but will break down on longer time scales. In some 
sources, the relative strengths of components will change 
more rapidly than their relative positions change, and this 
can greatly affect the magnitude of the structure effects. 

If the sources in a local reference frame could be 
thought of as unmoving beacons, spacecraft motion could 
be determined relative to those beacons. Of course, for 
absolute spacecraft position to be determined in a single 
observation, the positions of the reference sources would 
have to be determined relative to a VLBI catalog. If the 
observations used in constructing that catalog were made 
at a variety of hour angles, the scatter caused by source 
structure in the catalog observations would cause an error 
in the determination of the absolute spacecraft position. 


In summary, the source-structure errors may be an 
important effect in astrometric Mark III VLBI data taken 
with the DSN. With special effort, these errors could be re- 
duced somewhat. However, with currently practical meth- 
ods of reducing these errors, it is difficult to see how they 
could be made much smaller than about 30-50 psec of 
delay on individual observations of single sources. This 
would correspond to 1-2 nrad of position error (or 0.2- 
0.4 milliarcsec) on intercontinental baselines if the error 
mapped entirely into source position. Delay-rate scatter 
is likely to stay at least as large as 2 x 10 -15 sec sec -1 , 
giving a similar position error. The local-reference-frame 
method of determining spacecraft position has the poten- 
tial for reducing the scatter significantly if observations 
are scheduled carefully, although this method may not re- 
duce the error in tying such observations to a global VLBI 
catalog. 

As mentioned previously, the errors caused by source 
structure may not map solely into source-position errors, 
but may be absorbed by other parameters in a multi- 
parameter fit. For example, the long periods of apparent 
phase-delay drift shown in Fig. 9 might map into drift of 
the station clocks. An important next step for any analysis 
would be to understand how structure effects could affect 
a variety of parameters in a large fit. As part of that ef- 
fort, it would be important to isolate data that should be 
subject to large structure effects in the JPL VLBI data 
set; such points could then be eliminated in an effort to 
see how they have affected different parameters. 
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Table 1. Source maps used in statistical study 


IAU Name 

Other Name 

Structure Type [8, 9] 

Data Date 

0133 + 476 

OC 457 

Compact 

Dec. 1978 

0153 + 744 


Compact double 

April 1982 

0212 + 735 


Asymmetric 

Sept. 1980 

0316 + 413 

3C 84 

Unclassified 

Dec. 1978 

0454 + 844 

. . . 

Compact 

Aug. 1981 

0710 + 439 

OI 417 

Compact double 

Dec. 1982 

0711 + 356 

01 318 

Compact double 

Dec. 1982 

0804 + 499 

OJ 508 

Compact 

Dec. 1979 

0831 + 557 

4C 55.16 

Steep-spectrum core 

Dec. 1979 

0836 + 710 

4C 71.07 

Asymmetric 

Sept. 1980 

0850 + 581 

4C 58.17 

Unclassified 

July 1980 

0859 + 470 

4C 47.29 

Compact 

Dec. 1978 

0906 + 430 

3C 216 

Steep-spectrum core 

Dec. 1979 

0923 + 392 

4C 39.25 

Unclassified 

Dec. 1978 

1458 + 718 

3C 309.1 

Steep-spectrum core 

Dec. 1982 

1624 + 416 

4C 41.32 

Asymmetric 

July 1980 

1642 + 690 

4C 69.21 

Asymmetric 

July 1980 

1652 + 398 

4C 39.49 

Unclassified 

July 1980 

1807 + 698 

3C 371 

Asymmetric 

Dec. 1982 

1823 + 568 

4C 56.27 

Asymmetric 

Dec. 1979 

1828 + 487 

3C 380 

Steep-spectrum core 

Dec. 1978 

1928 + 738 

4C 73.18 

Asymmetric 

Sept. 1980 

2021 + 614 

OW 637 

Compact double 

Dec. 1982 

2200 + 420 

BL Lac 

Asymmetric 

Dec. 1978 

2351 + 456 

4C 45.51 

Asymmetric 

July 1980 

2352 + 495 

OZ 488 

Compact double 

Dec. 1979 


Table 2. Structure-induced scatter in VLBI observables on California— Spain 
baseline, as a function of visibility amplitude, for all 26 sources 


Visibility 


Scatter about centroid 


No. points 

Group delay, psec 

Phase delay, psec 

Delay rate, E-15 sec/sec 

© 

b 

1 

o 

2614 

271 


53.4 

285 

0.1 -0.2 

703 

478 


45.1 

891 

p 

io 

1 

p 

w 

419 

495 


33.4 

934 

0,3 -0.4 

195 

226 


19.8 

1610 

0.4 -0.5 

138 

216 


19.3 

1563 

0.5 -0.6 

78 

189 


5.1 

776 

b 

1 

p 

b 

50 

175 


3.8 

513 

°o 

b 

1 

t- 

b 

19 

143 


2.2 

514 

0.8 -0.9 

10 

137 


1.1 

333 

p 

b 

1 

b 

2 

26 


0.2 

772 


39 



Table 3. Structure-induced scatter in VLBI observables on California-Australia 
baseline, as a function of visibility amplitude, for the 12 visible sources 


Visibility 


Scatter about centroid 


No. points 

Group delay, psec 

Phase delay, psec 

Delay rate, E-15 sec/sec 

0 

b 

1 

o 

2273 

90 


68.9 

39 

0 

1 

o 

to 

600 

76 


24.7 

65 

CO 

b 

1 

b 

258 

63 


18.9 

80 

0.3 -0.4 

217 

63 


11.5 

121 

b 

1 

b 

101 

35 


4.3 

154 

0.5 -0.6 

45 

39 


2.5 

66 

0.6 - 0.7 

23 

24 


1.6 

85 

0 

1 

o 

bo 

24 

61 


1.2 

13 

0 
oo 

1 

© 

b 

9 

27 


0.8 

56 

0.9 - 1.0 

1 

26 


0.2 

134 


Table 4. Scatter of VLBI observables for circumpolar sources and other sources on 
California-Spain baseline, all 26 sources are included 


Sources 


Scatter about centroid 


Group delay, psec Phase delay, psec Delay rate, E-15 sec/sec 


No. points 


Circumpolar 465 385 30.3 4320 

Other 665 90 15.3 3871 


Table 5. Structure-induced scatter in VLBI observables on California-Spain baseline 
for all 26 sources, for points above a given visibility amplitude 


Scatter about centroid 


Visibility cutoff 

Group delay, psec 

Phase delay, psec 

Delay rate, E-15 sec/sec 

INo. points 

0.0 

569 

286 

24.4 

8191 

0.1 

298 

287 

22.8 

7906 

0.2 

193 

252 

18.1 

7015 

0.3 

126 

189 

14.3 

6081 

0.4 

90 

173 

11.7 

4471 

0.5 

46 

145 

3.2 

2908 

0.6 

26 

125 

2.2 

2132 

0.7 

12 

103 

1.4 

1619 

0.8 

6 

78 

0.7 

1105 

0.9 

2 

26 

0.2 

772 


40 


Table 6. Structure-induced scatter in VLBI observables on California-Austrafia baseline 
for the 12 visible sources, for points above a given visibility amplitude 


Visibility cutoff 


Scatter about centroid 

No. points 

Group delay, psec 

Phase delay, psec 

Delay rate, E-15 sec/sec 

0.0 

541 

50 

18.2 

813 

0.1 

216 

47 

10.6 

774 

0.2 

134 

43 

8.1 

709 

0.3 

109 

40 

5.5 

629 

0.4 

59 

32 

2.7 

508 

0.5 

23 

31 

1.4 

354 

0.6 

14 

29 

1.0 

288 

0.7 

8 

30 

0.5 

203 

0.8 

5 

27 

0.5 

190 

0.9 

1 

26 

0.2 

134 


Table 7. Structure-induced scatter in VLBI observables for different classes of source structure, 

on California-Spain baseline 


Class 


Scatter about centroid 


No. points 

No. sources 

Group delay, psec 

Phase delay, psec 

Delay rate, E-15 sec/sec 

All 

26 

569 

286 

24.4 

8191 

Steep-spectrum 

4 

833 

612 

49.3 

1259 

Compact double 

5 

469 

316 

30.9 

1535 

Asymmetric 

9 

101 

114 

7.9 

3337 

Unclassified 

4 

1244 

39 

12.1 

907 

Compact 

4 

15 

16 

1.8 

1153 

“Bad” 

10 

939 

459 

39.6 

2964 

“Good” 

16 

83 

93 

6.4 

5227 


Table 8. Structure-induced scatter in VLBI observables on California-Spain baseline, 
as a function of visibility amplitude, for the 16 “good” sources 


Visibility 


Scatter about centroid 


No. points 

Group delay, psec 

Phase delay, psec 

Delay rate, E-15 sec/sec 

0 
b 

1 

o 

349 

82 


24.1 

73 

0.1 -0.2 

163 

73 


11.0 

324 

0.2 -0.3 

135 

122 


11.2 

314 

0.3 -0.4 

94 

122 


7.2 

1051 

© 

1 

o 

b 

48 

109 


4.6 

1129 

0.5 -0.6 

28 

93 


3.9 

585 

b 

1 

<x> 

b 

17 

77 


3.8 

410 

00 

b 

1 

b- 

b 

16 

36 


2.2 

438 

0.8 -0.9 

5 

40 


1.4 

178 

© 

b 

1 

b 

2 

27 


0.2 

725 


41 



Table 9. Structure-induced scatter in VLB I observables on California-Australia baseline, 
as a function of visibility amplitude, for the 7 visible "good” sources 


Visibility 


Scatter about centroid 


No. points 

Group delay, psec 

Phase delay, psec 

Delay rate, E-15 sec/sec 

© 

© 

1 

© 

I-k 

876 

41 


55.3 

7 

© 

1 

p 

to 

187 

83 


24.5 

17 

0.2 -0.3 

78 

73 


10.5 

29 

p 

GO 

1 

O 

154 

68 


9.5 

52 

0 

1 

p 

Cn 

100 

35 


3.7 

129 

0.5 -0.6 

47 

39 


2.3 

47 

0.6 -0.7 

23 

25 


1.6 

76 

0 

-v) 

1 

p 

bo 

24 

61 


1.2 

13 

0.8 -0.9 

8 

79 


0.9 

6 

0.9 -1.0 

1 

30 


0.1 

107 


Table 10 
on 

. Scatter of VLBI observables for circumpolar 
California— Spain baseline, only the 16 "good” 

sources and other sources 
sources are included 

Sources 


Scatter about centroid 


Group delay, psec 

Phase delay, psec 

Delay rate, E-15 sec/sec 

Circumpolar 

Other 

97 

61 

115 

55 


7.0 2880 

5.6 2347 


Table 11. Structure-induced scatter in VLBI observables on California-Spain baseline 


for 16 

“good” sources, 

for points above a given visibility amplitude 




Scatter about centroid 



Visibility cutoff 







r 


Group delay, psec 

Phase delay, psec Delay rate, E-15 sec/sec 


0.0 

83 

93 

6.4 

5227 

0.1 

72 

93 

5.8 

5154 

0.2 

62 

94 

5.3 

4830 

0.3 

53 

92 

4.6 

4516 

0.4 

31 

80 

3.4 

3465 

0.5 

17 

62 

2.7 

2336 

0.6 

12 

47 

2.2 

1751 

0.7 

10 

32 

1.3 

1341 

0.8 

3 

30 

0.6 

903 

0.9 

2 

27 

0.2 

725 


42 


Table 12. Structure-induced scatter in VLB I observables on California-Australia baseline 
for the 7 visible “good" sources, for points above a given visibility amplitude 


Visibility cutoff 


Scatter about centroid 


No. points 

Group delay, psec 

Phase delay, psec Delay rate, E-15 sec/sec 

0.0 

135 

45 

8.8 

483 

0.1 

85 

45 

6.8 

476 

0.2 

79 

43 

4.6 

459 

0.3 

79 

40 

4.0 

430 

0.4 

62 

35 

2.4 

378 

0.5 

25 

35 

1.4 

249 

0.6 

14 

33 

1.1 

202 

0.7 

8 

38 

0.4 

126 

0.8 

2 

34 

0.2 

113 

0.9 

1 

30 

0.1 

107 


Table 13. Structure-induced scatter for 16 “good" sources on 3 baselines 


Baseline 

Length, km 


Scatter about centroid 

Group delay, psec 

Phase delay, psec Delay rate, E-15 sec/sec 

Goldstone-M ad rid 

8390 

83 

93 6.4 

Hay stack-Fort Davis 

3140 

28 

41 2.1 

Haystack-Green Bank 

840 

5 

3 0.3 


Table 14. Structure-induced scatter for all 26 sources on 3 baselines 


Scatter about centroid 

Baseline Length, km 

Group delay, psec Phase delay, psec Delay rate, E-15 sec/sec 


Golds tone-Mad rid 

8390 

569 

286 

24.4 

Haystack-Fort Davis 

3140 

104 

107 

8.1 

Haystack-Green Bank 

840 

42 

22 

2.7 



-300 -200 


Fig. 1. Plot of sampled (u ( v) points for a circumpolar 
source (0153+744) on the California— Spain baseline. 


Fig. 2. Plot of sampled (u,v) points for a lower decli- 
nation source (0711 + 356) on the California— Australia 
baseline. 
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Fig. 3. Computed visibility amplitude at 8.4 GHz for the compact double 
source 0710 + 439 on the California-Australia baseline. 
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Fig. 4. Structure-induced group delay at 8.4 GHz for the compact double 
source 0710 -I- 439 on the California-Australia baseline. 
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Fig. 8. Structure-induced group delay for the compact double source 
0153 + 744 on the California-Spain baseline. 
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Fig. 9. Structure-induced phase delay for the compact double 
0153 + 744 on the California-Spain baseline. 
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ig. 14. Structure-induced delay rate for the asymmetric source 1642 + 690 
on the California-Spain baseline. 



Fig. 15. Computed visibility amplitude for the asymmetric source 2351 + 456 
on the California— Australia baseline. 
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ig. 22. Structure-induced delay rate for the asymmetric source 2351 + 456 
on the Haystack, MA-Fort Davis, TX baseline. 
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Fig. 23. Visibility amplitudes from Figs. 15 and 19, shown on the same 

vertical scale. 
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An Apparatus for the Electrodynamic Containment 

of Charged Macroparticles 

A. Williams, R. Melbourne, L. Maleki, G. Janik, and J. Prestage 
Communications Systems Research Section 


The dynamic motion of the ions contained in the trapped 199 Hg+ frequency 
standard contributes to the stability of the standard . In order to study these dy- 
namics , a macroscopic analog of the 199 Hg+ trap is constructed. Containment of 
micron-sized particles in this trap allows direct visual observation of the particles' 
motion. Influenced by the confining fields and their own Coulomb repulsion , the 
particles can form stable arrays. 


I. Introduction 

JPL is involved in the research and development of 
new technologies which surpass the stability of standards 
such as the hydrogen maser. One of the technologies under 
development in the frequency standards laboratory (FSL) 
at JPL is the trapped mercury ion frequency standard. 

Like the cesium beam, hydrogen maser, and other 
atomic frequency sources, the trapped ion standard 
is based on an atomic transition. When an atom makes 
a transition from an energy level E\ to a lower level Ei, 


a photon of frequency v = (E\ — E^jh ( h being Planck’s 
constant) is emitted. Thus, an atomic transition creates 
a frequency source. However, the atomic energy levels are 
shifted, split, or broadened when disturbed by electric or 
magnetic fields, spin-exchange collisions, etc. Therefore, 
to obtain very stable frequency standards, it is desirable 
to protect the atoms from such offenders. This is the mo- 
tivation for holding the atoms in an electromagnetic trap 
in a very high vacuum [1]. 

This article describes the construction and operation 
of a macroparticle trap which will be used to study the 
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dynamics of charged particles confined in a trap and the 
parameters that influence the details of their motion. Since 
the trap allows the confinement of micron-sized particles, 
a direct visual observation of the dynamics in the trap is 
possible. In particular, phenomena such as cooling, crys- 
tallization [2], and evaporation of confined particles be- 
come directly observable. 

The versatility of this experimental environment per- 
mits the testing of various traps; the construction of a 
novel linear trap [3] for confinement of micron-sized parti- 
cles will enable the first study of the dynamics of charged 
particles in this trap geometry. Insight into the motion of 
charged particles confined in such a linear trap will allow 
the modeling of the dynamics of ions in the linear trapped 
199 Hg+ frequency standard. 

The magnitude of variation in the second-order 
Doppler shift comprises the largest practical limit on the 
frequency stability achievable with a trapped ion frequency 
standard [3]. Since the shift is due to the motion of ions in 
the trap, a model of the ion dynamics in the 199 Hg+ trap 
will aid the determination of parameters which minimize 
variation in the second-order Doppler shift. 

The organization of the article is as follows. First, 
in Section II, the theory of trapping charged particles in 
a radio frequency (RF) trap is discussed. The details and 
results of macroparticle trap research are described in Sec- 
tion III. Results obtained with a hyperbolic trap are given 
in Section IV, and a look ahead to other capabilities of the 
system is presented in Section V. 


II. Trapping Theory 

The type of trap that has been often employed for 
frequency-standard applications is the Paul trap, one type 
of RF trap. It consists of three hyperbolic electrodes: two 
endcaps, which are a hyperboloid of two sheets, and a ring 
electrode, which is a hyperboloid of one sheet (Fig. 1). 
The trap, when assembled with the ring between the two 
endcaps, has characteristic dimensions in cylindrical coor- 
dinates of z 0 , half the distance between the endcaps, and 
ro (usually set to be equal to \/2 • zo), half the distance 
across the ring. See Fig. 2 for a cross-sectional view of the 
assembled trap. 

To operate the trap, an AC voltage in series with a 
DC voltage is applied between the ring and the endcaps 
(Fig. 2), which produces a potential 


K ° c -^ c ” n, (^) ID 


inside the trap [2]. From this expression for the potential 
the electric field E = -V(/> is easily obtained: 


E = (-^c + ^Accosn O (2£ _ rl 

z o 


It is clear that the field intensities E z and E r (and hence 
the forces F z and F r ) vary linearly with z and r, respec- 
tively, and that the electric field E is zero at the center, 
or node (r = z = 0) of the trap. It is also seen that the 
field intensity in the z direction is twice as strong as that 
in the r direction and has opposite sign. This negative 
sign shows that when the confining fields point toward the 
node in one direction, they point away in the other. 

It may not be immediately obvious that an alternat- 
ing potential should stably confine charged particles. The 
“secret” lies in the fact that the forces toward the node 
increase with distance away from the node. Suppose that 
in one half-cycle of the potential, the force on a particle 
is away from the node (Fig. 3). At the second half of the 
cycle, the particle is situated at a point of stronger field 
than where it started. Now the force is not only directed 
toward the node, it is larger in magnitude, thus leaving the 
particle closer to the node than it started. It should now 
be clear that over one period of the trapping potential, the 
net effect is to move the particle nearer to the node. 

Stable confinement occurs in a region dependent on 
the charge-to-mass ratio of the particles, Va c> Vdc> aR d 
Q. When the system is operated with Vdc set to zero, the 
limit of stable containment is given by [2] 


^)(^)(^)-°- 9o8<o <3 > 


Thus, the charge-to-mass ratio can be determined by ex- 
ploring the boundary of the stability region, where the 
particle is no longer trapped. Either Vac or Q is held 
constant and the other varied in order to find the bound- 
ary of stable operation. This experimentally determined 
value can then be substituted into Eq. (3) in order to solve 
for the charge-to-mass ratio. 
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III. The Macroparticle Trap 

In 1959 the first visual observations of macroparticles 
in a Paul trap were made by Wuerker and his co-workers 
[2]. These investigations revealed interesting dynamic 
behavior when many particles were held in the trap — 
influenced by both the confining fields of the trap and their 
own Coulomb repulsion, the particles formed a stable array 
(Fig. 4). When the driving frequency of the trap was in- 
creased, the crystal “melted” and the particles underwent 
random motion in the trap. If the frequency was subse- 
quently decreased, the particles would once again take up 
their stable array. 

The first version of the JPL macroparticle trap 
consists of a hyperbolic electrode configuration with ro = 
1.9 cm. Alumina powder (particle size nominally 5 fim 
is loaded into a small basket of stainless wire mesh. The 
basket is suspended from a Sonalert piezoelectric beeper 
which, when activated, shakes the basket and causes a 
fine mist of powder to fall through the mesh. This beeper- 
basket assembly is suspended above the trap, and the par- 
ticles fall into the trap through a hole in the top endcap 
(Fig. 5). It is also possible to release particles from the 
basket without using the beeper by causing a discharge 
around the basket. 

The entire trapping apparatus is mounted inside a 
custom vacuum chamber which has electrical feedthroughs 
on its bottom flange, a viewport for viewing in the 
r-z plane, two illumination windows perpendicular to the 
viewport, and a vacuum port. A mechanical pump evac- 
uates the chamber to a pressure of about 5 mtorr. While 
the trap is operated primarily in this pressure regime, op- 
eration with pressures as high as 200 mtorr has also been 
carried out. The particles may be illuminated by any light 
source with sufficient power to make them visible; this par- 
ticular system has been illuminated by a 0.5-mW helium- 
neon laser, a 15-mW argon ion laser, and a 300-mW dye 
laser. A Bausch and Lomb microscope with magnification 
20 x completes the viewing apparatus. 


is also a DC voltage (V^) which can be applied to the top 
endcap in order to counteract the gravitational pull on the 
particles (Fig 2). 

With this system particles are readily trapped when 
the pressure in the vacuum chamber is lower than about 
200 mtorr. Often both positively and negatively charged 
particles are trapped. The sign of the DC potential V g 
applied to the top electrode of the trap allows the deter- 
mination of the sign of the charge of the trapped particles. 


IV. Results 

Operating the trap in this way reveals behavior simi- 
lar to that seen by earlier experimenters. Up to about 50 
particles have been trapped by this system. Often both 
positively and negatively charged particles are contained. 
In the many-particle case, a crystal formation centered on 
the node and outlining the trapping fields is seen when V g 
cancelled the weight of the particles (Fig. 6). The 60-Hz 
micromotion of the particles is seen to decrease in ampli- 
tude, as it should, when the particles are nearer to the 
node. With V g = 0, the particles are pulled by gravity 
into the bottom of the trapping potential still in their 
crystalline formation. By adjusting the trapping poten- 
tials, the crystal can be broken and reformed. The parti- 
cles have a wide range of charge- to- mass ratios, which can 
be approximately determined from the potential V g nec- 
essary to counteract the gravitational force, on the order 
of 10 -3 °C/kg. Those particles whose low charge- to- mass 
ratios cause them to locate themselves on the lower region 
of the particle cloud and can be ejected from the trap by 
varying V g . 


V. Other Capabilities of the System 

Since fi/27T is fixed at 60 Hz in this system, the charge- 
to-mass ratio of the particles can also be determined by 
measuring the maximum V A c for stable confinement and 
using this measured value in Eq. (3): 


The 5-micron alumina particles have enough static 
charge to be trapped readily by the potential generated 
with Vac « 300 volts at frequency fi = 27r60 Hz. The 
trapping potential can thus be conveniently derived from 
ordinary wall power (110 V, 60 Hz) passed through a variac 
and a step-up transformer. By adjusting the variac, the 
RF potential can be chosen to be any value between 0 and 
440 V. The DC voltage (Vdc) in series with the AC voltage 
is set to zero, and the bottom endcap is grounded. There 


— = 0.908 
m 


(1.9 cm) 2 (2 tt • 60 Hz) 2 

4 ^r 


( 4 ) 


By replacing the hyperbolic electrodes with any other 
suitably sized electrode configuration, the motion of 
charged particles in novel trap geometries can be studied 
with this system. The linear trap previously mentioned 
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has been implemented in the macroparticle system in this 
way, allowing visual study of the particle motion associated 
with the linear trapped 199 Hg + frequency standard. A 
thorough study of the dynamics of particles in the linear 
trap is currently under way. 

Since any dustlike or aerosol material with appropri- 
ate charge- to-mass ratio can be trapped in this system, a 
multitude of experiments on various substances are possi- 
ble. For example, there are plans to operate the system at 
0 °C so that ice particles can be studied; such an investiga- 
tion has significant application to atmospheric physics. A 
version of the system suitable for installation on a space- 
craft could trap cosmic dust, which could subsequently be 


irradiated with laser light for spectroscopic study of this 
material. 

The physics of many-particle systems can also 
be readily studied with use of a macroparticle trap be- 
cause the number of particles in the trap can be increased 
(by activating the beeper) or decreased (by ejecting those 
particles with extreme charge-to-mass ratios). The micro- 
motion of the particles and the form of the crystal may 
be observed as the crystal is cooled by viscous drag or 
heated by increasing Vac* Quantitative measurements on 
the phenomena of crystallization and melting as functions 
of number of particles and particle energy help to describe 
the nature of several-particle systems. 
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Fig. 4. Results of early work in macroparticle traps from 
[2], showing crystalline formations: (a) about 100 parti- 
cles viewed in the r-z plane, (b) 5 particles viewed in the 
r-6 plane, and (c) 32 particles viewed in the r-6 plane. 



Fig. 5. Diagram of the present macroparticle introduc- 
tion system, showing the beeper-basket assembly used 
for introduction through the top endcap. 



Fig. 6. An r-z photograph of about 45 alumina particles 
in the trap described in this article. 
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Spin-Lattice Relaxation and the Calculation of Gain, 
Pump Power, and Noise Temperature in Ruby 

J. R. Lyons 

Telecommunications Systems Section 

This article describes how a quantitative analysis of the dominant source of 
relaxation in ruby spin systems is used to make predictions of key maser amplifier 
parameters. The spin-lattice Hamiltonian which describes the interaction of the 
electron spins with the thermal vibrations of the surrounding lattice is obtained 
from the literature. Taking into account the vibrational anisotropy of ruby, Fermi’s 
rule is used to calculate the spin transition rates between the maser energy levels. 

The spin population rate equations are solved for the spin transition relaxation 
times, and a comparison with previous calculations is made. Predictions of ruby 
gain, inversion ratio , and noise temperature as a function of physical temperature 
are made for 8.4-GHz and 32-GHz maser pumping schemes. The theory predicts 
that ruby oriented at 90 deg will have approximately 50 percent higher gain in dB 
and slightly lower noise temperature than a 54.7-deg ruby at 32 GHz (assuming 
pump saturation). A specific calculation relating pump power to inversion ratio is 
given for a single channel of the 32-GHz reflected wave maser. 


I. Introduction 

Ruby has been the active material employed in all 
DSN masers [1], and is again being considered for use at 
32 GHz. In order to be able to predict the best operating 
point (i.e., crystal orientation, magnetic field intensity, and 
pumping scheme) available at a particular frequency and 
temperature, it is necessary to understand the relaxation 
behavior of the spins. 

The spins in ruby are Cr 3+ ions introduced as an im- 
purity in the sapphire lattice. Because the spins are lo- 
calized in the lattice, they obey classical particle statistics 


and have a Boltzmann distribution of energies at ther- 
mal equilibrium. To achieve maser operation, a spin pop- 
ulation inversion is created by pumping the appropriate 
transitions at their resonance frequencies [2]. The incom- 
ing signal is then amplified by inducing stimulated emis- 
sion between the signal transition levels. The amount of 
pump power required to maintain the population inver- 
sion depends on the rate at which the spins relax back to 
lower levels. With a knowledge of the relaxation rates, one 
may compute the corresponding inversion ratio, gain, and 
noise temperature of the ruby. Conduction losses would, 
of course, be accounted for separately. 
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The dominant spin relaxation mechanism in ruby at 
microwave frequencies and liquid He temperatures is due 
to interaction of the spins with the lattice, the so-called 
Kronig-Van Vleck mechanism [13]. Thermal vibrations 
of the AI 2 O 3 lattice induce spin transitions via spin-orbit 
coupling. A so-called magneto-elastic tensor which char- 
acterizes the coupling of the Cr 3+ ions to the lattice has 
been measured at 10 GHz [3]. Assuming the “direct” pro- 
cess of relaxation in which a single quantum of energy 
is exchanged between the spins and the lattice, Donoho 
[ 4 ] calculated the spin relaxation rates for an isolated spin 
based on the measured value of the magneto-elastic tensor. 
Based largely on Donoho’s work, the calculation of relax- 
ation rates was performed at 32 GHz for maser pumping 
schemes of interest. 

As a means of comparing theory with experiment, 
Donoho also calculated the relaxation times associated 
with the saturation and recovery of a pair of spin levels. 
The calculation is performed by solving a linear set of first- 
order differential equations involving the level populations 
to determine the characteristic times associated with the 
return to equilibrium. The relaxation times depend on 
field strength, crystal orientation, and temperature, and 
are readily compared with measured values. Varying de- 
grees of agreement between theory and experiment have 
been reported in the literature [5]. Because of the depen- 
dence of the measured relaxation times on crystal growth 
procedure, we are having measurements performed at the 
University of California, San Diego on Union Carbide ruby 
at frequencies of 9 GHz and 35 GHz. Note that the mea- 
surement of relaxation times is important not only in veri- 
fying the theory, but also because shorter relaxation times 
generally imply greater pump power requirements. 

In addition to computing relaxation times, the present 
work computes the resulting spin populations for various 
amounts of pump power. Knowing the spin populations, 
all parameters of interest, such as ruby gain and noise 
temperature, may be determined. Hence, this theory pro- 
vides a means of comparing in detail the different operat- 
ing schemes for a 32-GHz maser at various temperatures. 
Experiments are being planned to measure inversion ra- 
tio, gain, and noise temperature or ruby at the 32-GHz 
operating points of interest. 

It should be pointed out that a significant approxima- 
tion has been made in assuming that the lattice interacts 
with an isolated spin. The Cr concentration is actually 
dilute (~ 0.05 percent), but if the ions cluster or if signif- 
icant amounts of magnetic impurities or crystal imperfec- 
tions are present, the relaxation behavior of the spins can 
be greatly modified [5]. In this work it is assumed that 


spin-spin interactions do not significantly affect the spin 
relaxation rates, but experiment may prove this to be a 
poor assumption. 

This article proceeds as follows. Section II discusses 
the spin and spin-lattice Hamiltonians for ruby. Section III 
outlines the calculation of the spin-lattice transition rate. 
Section IV describes the solution of the time-dependent 
rate equations for the relaxation times and compares them 
with Donoho’s [4] results. Section V discusses the calcula- 
tion of inversion ratio, gain, pump power, and noise tem- 
perature. Conclusions are presented in Section VI. 

II. Spin and Spin-Lattice Hamiltonians 

A. Spin Hamiltonian 

The spin Hamiltonian formalism [ 6 , 13] is a semiem- 
pirical technique for describing the interaction of the elec- 
tron spin of a paramagnetic ion with the static crystal 
electric field and applied magnetic field. Evaluation of the 
spin Hamiltonian yields values for the spin energy levels 
and rates of spin transition between levels due to the ap- 
plication of an RF magnetic field. The spin-lattice Hamil- 
tonian is also a semiempirical formalism, but it describes 
the interaction of the paramagnetic ion spin with the time- 
varying crystal electric field associated with thermal lattice 
vibrations. 

The spin Hamiltonian for ruby is discussed in [ 8 ], but 
a brief review is given below for completeness. The spin 
Hamiltonians for several other potential maser materials 
have been evaluated recently [9]. 

The form of the spin Hamiltonian often reflects the 
symmetry of the surrounding crystal field. In ruby, Cr 3+ 
ions enter the AI 2 O 3 lattice substitutionally for Al 3+ . 
Each Al/Cr ion is surrounded by a distorted octahedron 
of O 2 " ions such that the local site symmetry is C 3 . The 
trigonal axis of the site, which coincides with the c axis of 
the ruby, is labeled the z axis, as shown in Fig. 1. The 
spin Hamiltonian for ruby is [10] 

H, = gPB-S + D{s]--^j ( 1 ) 

where the spectroscopic splitting factor is approximately 
isotropic with the value g = 1.98, f3 is the Bohr magneton, 
B is the applied dc magnetic field, and the crystal field 
splitting factor is D — -5.73 GHz. S z is the z component 
of the vector or spin operators, 
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S — SxX *+■ Syll “I" SzZ (2) 

where the Si are given in standard texts on quantum me- 
chanics [11]. 

The first term in Eq. (1) is the Zeeman interaction 
and the second term represents the interaction of the spin 
with the static crystal electric field via spin-orbit coupling. 
Because the ground-state orbital is a singlet, the orbital 
angular momentum is quenched [13]. Hence, the crystal 
field splitting may be said to arise from admixture of the 
spin with the orbital angular momentum of higher-lying 
orbitals. Applying second-order perturbation theory, one 
obtains a splitting term that is second order in spin, as 
shown by Eq. (1). Detailed accounts of the derivation 
of the form of Eq. (1) based on the equivalent operator 
method are given elsewhere [8]. The parameter D is deter- 
mined empirically, as a first-principles calculation is very 
difficult, perhaps because the bonding is partly ionic and 
partly covalent. 

Cr is a transition metal element with the electronic 
configuration 3d 5 4j. The Cr 3 + ion has the configuration 
3d 3 . For the free ion, L = 3 and S = 3/2, so the term 
is 4 F 3 / 2 * A qualitative and simplified energy splitting dia- 
gram for Cr in sapphire showing the sequence of degener- 
acy removing perturbations is given in Fig. 2 [13]. 

To determine the spin energy levels, each spin state is 
expressed as a sum of pure spin states, that is, quantum 
states in which spin-orbit coupling is neglected, as follows: 

I *•) = a< I + f ) + bi I + ^) + Q I “$) + di I "f ) (3) 

The eigenvalue equation 

H, | *,) = Et | (4) 

is solved for the energies E{ and eigenvectors, 1^*). Know- 
ing the eigenstates, one may calculate Wij, the rate of spin 
transitions stimulated by the time-varying magnetic field 
H i between levels i and from Fermi’s rule [2]: 

Wij = \7 2 9ij (f)\ < 1* • S\9j > I 2 (5) 

where j = gPii o/h and gij(f) is the line shape function 
as a function of frequency. Plots of energy levels versus B 


and isofrequency plots of B versus 0 for ruby are given by 
Schultz-duBois [10]. 

An important point concerning the mixing of pure 
spin states should be made. For pure spin states, the selec- 
tion rule AM = ±1 applies, where M is the spin quantum 
number equal to 3/2, 1/2, —1/2, or —3/2 for S = 3/2. 
When spin-orbit interactions are included for a Cr 3+ ion 
in a crystal, the resulting spin quantum states are a mix- 
ture of pure spin states, and as such cannot be labeled by 
a single value of M. Therefore, the AM = ±1 rule can- 
not be strictly applied. This mixing of pure spin states is 
essential to continuous wave (CW) maser operation. 

B. Spin-Lattice Hamiltonian 

So far, the time- varying crystal field arising from lat- 
tice vibrations has been neglected. To describe the effect 
lattice vibrations have on the spin states, it is convenient 
to define a spin-lattice Hamiltonian, Hsl (derivations of 
the spin-lattice Hamiltonian are given in [14, 15]. Since 
the lattice vibrations may cause a slightly anisotropic per- 
turbation of the crystal field, a reasonable form for Hsl 
is [4] 

Hsl = 'Y^dijSiSj, i,j = x,y,z ( 6 ) 

where |D|. For the one-phonon (direct) process of 

energy exchange, Van Vleck [16] assumed that Hsl was 
linear in strain. The d tensor may then be expressed as 

d{j = ^ i kyl— x,y,z (7) 

ki 

where G is a fourth-rank tensor having many of the sym- 
metry properties of the classical elastic stiffness tensor re- 
lating stress and strain in a material. The tensor e is the 
classical strain given by 

1 (du k dui \ 

where u* is the displacement of atoms in the direction a:*. 
The above form of Hsl neglects spin-spin interactions. 

Calculation of the magneto-elastic-tensor G is of the 
same order of difficulty as calculation of the spin Hamil- 
tonian parameter D. Therefore, G is determined exper- 
imentally. In [3] G was measured by applying a known 
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uniaxial stress to a ruby crystal and observing the change 
in the EPR spectrum. The spin-lattice Hamiltonian given 
by Eqs. ( 6 ) and (7) is treated as astatic perturbation to the 
energy levels computed from the spin Hamiltonian. The 
measurements were performed at 10.1 GHz at room tem- 
perature. Line shifts were in the range of 10 to 50 Gauss 
when the crystals were strained. Five crystal samples were 
employed, each having a different stress axis. The compo- 
nents of G were obtained by least-squares analysis of the 
data. The Cr concentration of the samples was thought to 
be « 0.05 percent. At this concentration, spin-spin inter- 
actions may have a non-negligible effect on the spectra, so 


the G tensor values could be somewhat in error; however, 
this possibility has not been investigated. 

The number of independent components in G can be 
reduced by symmetry arguments. The form of a fourth- 
rank tensor with C 3 symmetry is given in [17], and is shown 
to have twelve independent components. Choosing trace 
( Hsl ) — 0 , which shifts the corresponding energy lev- 
els by an insignificant constant, reduces the number of 
independent components to ten. The G tensor satisfies 
Gijkt = Gjiki = Gijiki and so may be written in Voigt 
notation by defining the indices 1 = xx, 2 = yy, 3 = zz , 
4 = yz, 5 = xz, 6 = xy. In Voigt notation G has the form 


G11 

Gl 2 

-G33/2 

G14 

— G25 

G16 

G12 

Gu 

— G33/2 

— G14 

G25 

— G16 

“(Gn + G x2 ) 

— (Gn 4 * G12) 

G33 

0 

0 

0 

G41 

-G41 

0 

G44 

G45 

G52 

"-G52 

G52 

0 

— G45 

G44 

G41 

— Gi6 

Gi6 

0 

G25 

G14 

2(^11 — G12) 


The G values (in GHz) used in [4] and in this work are the 
following: 


Gn = 

124.6 

G 4 1 

= -15.0 

G 12 = 

-35.8 

^25 

= 45.0 

G 33 = 

181.2 

^52 

= 45.0 

0 

II 

54.0 

Gi6 

= 0 

G 14 = 

-15.0 

G 45 

= 0 


The factor of h 2 from the S{Sj term is absorbed into the 
G values. 

For interactions with the AI 2 O 3 lattice, the Cr 3+ ions 
can be considered to occupy two inequivalent sites, as dis- 
cussed in [3]. These sites are related by a 2-fold rotation 
about the x axis. For the two sites all the elements of 
G have the same magnitude, but G 25 , G 52 , G 16 , and G 45 
have opposite signs. In [4] and in this work only one Cr 
site, specified by the elements of G given above, is ac- 


counted for. Computer experiments have shown that the 
second site does not lead to significantly different relax- 
ation times or inversion ratios. 

The tensor G characterizes the effect a given strain 
has on the EPR spectrum. To relate G to the thermal lat- 
tice vibrations one must know the strain associated with a 
given lattice mode and also the number of phonons present 
(i.e., the phonon occupation number). Since the phonon 
wavelengths of interest are ~ 10“ 4 cm (for a sound veloc- 
ity ~ 10 6 cm/sec), the atoms may be assumed to undergo 
displacements of equal amplitude in any unit cell. (This is 
an approximation since the different mass ions would un- 
dergo slightly different displacements, thus modifying the 
local crystal field.) Then, for phonons of wavevector k and 
polarization vector i p , the displacement may be written as 



where M is the crystal mass, c 0 is the angular frequency, 
and a and a + are the phonon annihilation and creation op- 
erators. The phonon operators arise from an analogy with 
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the quantum harmonic oscillator, as discussed in any solid 
state physics text [18]. They have the following properties: 


<p 

n *,p) 

' = \/nk,p 

4- 1 | n k,p + 1^ 

(ii) 

^k ,p 

njb, P ) 

1 = V n k,P 

n k t p 1^ 

(12) 


where n kp is the phonon occupation number for the state 
fc,p, and where subscript k is the mode wavevector and p 
is the mode polarization (two transverse and one longitu- 
dinal). From Eqs. (8) and (10), the strain becomes 

j j 2 

€ii = (jfMw) { a tp~ a ~ k ’p) ( e p.'^ +€ p,i**') c ’* r ( 13 ) 


where z, j — x , t/, z. 


where |\£,') is the state vector for level i as determined 
from Eq. (4), \n k p ) is the nth phonon occupation state for 
a phonon with wavevector k and polarization p, p(w) = 
dN/dw is the density of phonon states, as a function of fre- 
quency, and dQ is an infinitesimal solid angle. In Eq. (14), 
the perturbing Hamiltonian Hsl changes the quantum 
state from that of a spin in state i and n phonons in state 
k,p to that of a spin in state j and n + 1 phonons in state 
k,p. This corresponds to the creation of a phonon when 
a spin drops from a higher energy state to a lower energy 
state. 

From [18], the density of states for each polarization 
in a periodic solid is 

< is > 

where V is the crystal volume. Making the Debye approx- 
imation in which the velocity of sound v is a constant for 
each polarization type, the dispersion relation is simply 
w = vk. Equation (15) then becomes 


III. The Spin-Lattice Transition Rate 

In determining the G tensor, the spin-lattice Hamil- 
tonian is static since the applied strain is static. How- 
ever, in modeling the interaction of the Cr spin with the 
thermal lattice vibrations, the strain is time periodic. The 
resulting Hsl is therefore a dynamical Hamiltonian requir- 
ing the application of time-dependent perturbation theory, 
specifically Fermi’s rule. 

We are interested in lattice modes that are on “speak- 
ing terms” with the spins, that is, vibrating at frequencies 
that correspond to spin transitions. Fermi’s rule is used 
to calculate the transition probability rate, u>ij, for spin 
transitions from level i to level j due to spin-lattice in- 
teraction. Since ruby is vibrationally anisotropic, an in- 
tegration over all possible phonon directions is performed. 
The three phonon polarizations (one longitudinal and two 
transverse) and the phonon density of states are also ac- 
counted for. Applying Fermi’s rule, 





Hsl Vj, n k tP + l) 


Vcj 2 

*"> * Sv (16) 

The Debye temperature of sapphire is 980 K [19], which 
corresponds to a frequency of « 2 x 10 13 Hz, so the Debye 
approximation is assumed to be valid for the one-phonon 
relaxation process. Anharmonic crystal interactions and 
multiple phonon processes are neglected. 

Consider now the matrix element in Eq. (14). Substi- 
tuting Eqs. (6) and (7) for Hsl in Eq. (14), and separating 
spin and phonon operators, one obtains 


/ h \^ 2 

(**’"** | Hsl | *>' nk *P + *) = 

* (^ n k>P | a l t t p ~ a ~ktP | n *>P _ 

X ^ ^ G m nst(^i | J ^ j ^ k% *f* C Pj tk $ ) 

mnst 

(17) 

where m, n,s,< = x,t/, z. Applying Eqs. (11) and (12), the 
phonon matrix element becomes 


x p(u>)- 


(14) 


(nk,p a t,p- a -k,P n k,p + l) = -yfa + 1 (18) 
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For a large number of spins, n k fP may be replaced by the 
average phonon occupation number as given by a Bose- 
Einstein distribution of phonons (spin-one quasi-particles) 
at the lattice temperature T, 


( n *,p) 


avg 


1 

Huj 

e™ - 1 


(19) 


5? = 5,5, 

5| = 5,5, 

Si = 5,5, + 5,5, 


Si = SySy 

Si = Sy 5, + 5, 5y 
5| = 5, 5y + 5y 5, 


and 


( 23 ) 


Substituting Eqs. (16) through (19) into Eq. (14), one ob- 
tains 


Uij 


u 1 

32it 2 p c h j 



Lp t 1 =: 2 €p X l x 
Lp, 3 = 2 Cp^zlz 
Lp f 5 — 2 (fp^/r "1" £p,zlz) 


Lp , 2 “ ^Cp^/y 

Z/p ( 4 = 2 (Cp,yf* + £p,*/y) 

£p,6 = 2 (Cp j2? /y + Cp,y/ r) 

(24) 


x 



X (Cp,,*t +Cp,*M 



( 20 ) 


where and 'Fj have been denoted by i and j , p c is the 
crystal density, and Vfc iP is the phonon velocity for mode 
k,p. Equation (20) is Eq. (11) in [4]. 


Using the dispersion relation u> — 2irf = kvk tP , where / is 
the frequency of the spin transition between levels i and 
j, and substituting Eqs. (21) through (24) into Eq. (20) 
yields 



x 


6 


^ | Si | j)L PtV 



dQ 


(25) 


In order to simplify Eq. (20), it is convenient to use 
Voigt notation and to make some new variable definitions. Defining 
Define the direction cosines of the wavevector k as follows: 


u = 


ki_ 

k ’ 


i = x,y,z 


( 21 ) 


D,u — y ^ G UV L, 


P, v 




(26) 


Writing G in Voigt notation, the summation over m, n,s,f 
in Eq. (20) may be written as 


^ ^ G mn $t(^ | S m S n | ( £p t $kt + Cp.t^*) — 

must 


k Y, G uv (i | Si | j)l PiV (22) 


where S% and L PtV are defined as follows: 


the modulus-squared quantity in Eq. (25) can be written 
as 


Y G„.(i | Si | j)L PiV 


2 


6 

q,r = l 



Substituting Eq. (27) into Eq. (25) and interchanging the 
integral and the sum over polarization with the sum over 
q and r, Eq. (25) becomes 
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( 32 ) 


Uii ~ 2 p e h f =$- ^ S,r (’ I I j )(* I I J ) 

1 _ g «T ?,r = l 

(28) 


where 


'a h g 


Ax 


Ax 

H B F 


Ay 

= PcV 2 

Ay 

G F C 


Az 


Az 





1 





where Ax, Ay, A z are the material displacements which 
taken together specify the polarization vector. The matrix 
elements A, B , C and F, G, # are functions of the c tJ - and 
the wavevector direction cosines: 


Equation (29) is evaluated by a numerical integration over 
47 r steradians. Specifically, 


f { = f ( { }sin 9 d 9 d<f> 

Jatt Jo Jo 


(30) 


where the double integral is evaluated as a double sum. 
Numerical experimentation shows that the double sum 
converges for a summation step of AO = A <j> = 5 deg. 

The density of sapphire is 4.0 gm/cm 3 [20]. For G and 
/ specified in units of GHz, Vk tP in units of 10 5 cm/sec, and 
S 2 dimensionless, the constant in front of Eq. (28) becomes 
8.2 x 10“ 7 and has the units of sec -1 . 

In order to evaluate Eqs. (24) and (29), one needs 
the phonon velocity and polarization vector for the three 
phonon polarizations for each wavevector direction in the 
47T steradians. These are the quantities of interest in the 
so-called Christoffel equations [21], which describe elastic 
wave propagation in an anisotropic elastic medium. The 
equations are a coupled set of three equations of motion 
for volume elements of the crystal; Kittel [18] discusses the 
case of cubic symmetry. 

Wachtman et al. [22] gives the appropriate equations 
for single-crystal AI2O3, which has trigonal symmetry. 
The elastic constants, c i; - , (Voigt notation) are defined by 

6 

<r. = Yl Ci i c i ( 31 ) 

;=i 


where a is the stress tensor and e is the strain tensor. Val- 
ues for the six independent elements of c,j are determined 
at room temperature by a resonance method discussed in 
[ 22 ]. 


For the coordinate system of Fig. 1, the equations of 
motion for an elastic wave with frequency u — k • v may 
be written as 


A = c n l 2 + \ (c n - c 12 ) m 2 + c 44 n 2 + 2 ci 4 mn 
B = ^(cn -C12)/ 2 +cnm 2 +c 44 n 2 - 2 ci 4 mn 
G = c 44 (I 2 + m 2 ) + c 33 n 2 
F = C14 (/ 2 - m 2 ) + (c i3 -f C44) mn 
G - 2 c 14 /m + (c 13 + C44) In 
H = \ (cn + C12) lm + 2 c i4 /n 


where / = sin 9 cos <f>, m = sin 9 sin <j> ) and n = cos^. For 
each wavevector direction in Eq. (30), Eq. (32) is solved 
for the phonon velocity and polarization, as given by the 
eigenvalues and eigenvectors, respectively, of the matrix in 
Eq. (32). 

The B qr quantities defined in Eq. (29) need only be 
calculated once for each Cr site; the resulting values are 
stored in computer memory. For Cr site number 1 (i.e., 
for G given by Eq. 9) the matrix B has the form 


B 11 

B \2 

B 13 

B 14 

#15 

#16 

Bi 2 

B n 

B 13 

—B 14 

-B 15 

— #16 

B 13 

B13 

B 33 

0 

0 

0 

B H 

-F14 

0 

B 44 

0 

-#15 

B 15 

-Bib 

0 

0 

B 44 

#14 

B 16 

B\q 

0 

-B 15 

jB ,4 

#66 


where the independent components have the following val- 
ues: 
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11 = 32.2 

B\s 

= -0.0030 

12 = -13.9 

B 33 

= 36.6 

13 = -18.3 

B 44 

= 11.7 

14 = -3.67 

B&6 

= 23.1 

'is = -14.04 




dn <(Q 

dt 


* 

= -w O -n <(0). *= 1 4 (37) 


where n,*(<) is the instantaneous spin population of level i. 
Conservation of the number of spins requires that 


For Cr site number 2 the B qr have the same magnitude, 
but and Biq have opposite signs. For the B qr given 
above, u>ij is calculated from Eq. (28) for each field inten- 
sity, angle, transition, and temperature of interest. 

Finally, consider the frequency and temperature de- 
pendence of Uij. From Eq. (28), the temperature depen- 
dence is determined by the factor (1 — e~ h ^ kT )“ l } for 
which 


+ n 2 + n 3 + n 4 = N 


where N , the total number of spins, is constant. Equa- 
tions (37) and (38) reduce to a system of three equations, 


n = An -h 6 


1 — e 


hZ h± i 

hf > kT ^ 1 

1 , kT ^ ^ 


where 


Thus, for hf/kT 1, is proportional to tempera- 
ture, or, since the relaxation time T\ is ~ w"- 1 , T\ ~ T~ l . 
A relaxation time inversely proportional to physical tem- 
perature is a characteristic of the one-phonon relaxation 
process, and has been observed for many paramagnetic 
materials (including ruby) at liquid He temperatures [13]. 
The multiple-phonon relaxation processes exhibit relax- 
ation times with significantly different dependence on tem- 
perature, e.g., T\ oc T“ 7 (Raman process). 

The frequency dependence of u> i; * is contained in the 
factor / 3 • ( l — e~ h f l kT )~ l and also in the mixing of spin 
states. 

Also note the strong dependence of Uij on the veloc- 


ity of sound. From Eqs. (28) and (29), u>jj 


\ so if 


long relaxation times are desired, a hard material such as 
sapphire is beneficial. 


n i 

n = «2 


uj 41 - N 
b = cj 4 2 * N 
c^43 ■ N 


Ml ^21—^41 ^31— ^41 

A — CJ 12 — ^42 A 22 ^32 — ^42 

^13 — ^43 ^23 — ^43 A33 


IV. Spin-Lattice Relaxation Times 

The spin-lattice relaxation times, T \ , are of interest 
primarily because they are more accessible to measure- 
ment than the transition probability rates. The relaxation 
times are calculated by solving a set of time-dependent 
rate equations for the spin populations. The rate equa- 
tions, not including pump-induced transition rates, for the 
ith level are [2, 4] 


where 


An = “ ( w 12 + ^13 + ^14 + <^4l) 
A22 “ ~ (^21 + ^23 + ^24 + ^42) 
A33 = — (c^3i -f CJ32 + ^34 4 * W43) 
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Note that 




= 


and the c,- are the eigenvectors of A (written as column 
vectors) with the corresponding eigenvalues A;. The nor- 
malized population difference between levels i and j is 


where / = /,- — fj y as can be shown by manipulation of 
Eq. (28). 

Experimentally, the usual procedure for measuring T\ 
is to saturate a pair of levels with a strong microwave pulse 
and observe the recovery of the same or another pair of 
levels as the spins return to thermal equilibrium. For non- 
interacting spins (i.e., no cross- re taxation) the recovery is 
exponential, with T\ being identified as the time constant 
of the decay. A theoretical T x is determined by solving 
Eq. (39) for an appropriate set of initial conditions. For 
example, if levels one and two are saturated, then the ini- 
tial conditions would be 


n(t = 0) = 


h(Ni + N*) 
§ (Ni + N 3 ) 

N 3 


(45) 


where Ni is the thermal equilibrium spin population of 
level i. The Ni satisfy a Boltzmann distribution, 


Ni_ 

Nj 



(46) 


and also satisfy the conservation relation, 


Ni A N 2 + N 3 + A 4 = N (47) 


The solution to Eq. (39) is straightforward and may be 
found in [23]: 


n(<) = aiCie Al * + + ot 3 c 3 e Xzt - A l b (48) 


where 


ot i 
OC2 
a 3 


= (ci c 2 c 3 ) 


-l 


(n(0) + ^" 1 6) 


(49) 


nj - nj 

Ni - Nj 


1 + A x e t f Tl + A 2 e % I T * + A 3 e i ! T * (50) 


where the Ti = —1/A i are the relaxation times, and the 
Ai are determined from the a,- and c,*. Thus, the recovery 
of each pair of levels is characterized by three relaxation 
times. In fact, the same three relaxation times characterize 
the recovery between any pair of levels for a given initial 
condition; however, the amplitudes, A,*, will vary for the 
different transitions. 

Donoho [4] plots Ti and A,- as a function of 8 and 
transition for a frequency / = 9.3 GHz and a physical 
temperature T = 4.2 K. In [4] there are also several plots 
of T\ versus frequency (1-10 GHz) for particular angles 
and transitions. Generally it was found that only one or 
two relaxation times were important, usually the longest 
two. These calculations were repeated as a check on the 
computer code. Table 1 compares two sets of 7} and Ai 
values from [4] with the author’s calculations for an angle 
9 — 54.7 deg. The Ti from the author’s calculations are 
approximately a factor of one-half less than Donoho’s re- 
sults, which leads to better agreement between theory and 
experiment at 10 GHz [5, 24]. More troublesome, however, 
is the lack of consistent agreement among the A,- values. 
Although the cause of the disagreement is unknown, the 
disagreement is not considered significant in light of the 
consistent relaxation times computed. 

Previous comparisons of experimental and theoreti- 
cal values of Ti have generally not shown good agreement 
even for very dilute ruby (~ 0.01 percent) [5]. For exam- 
ple, the relaxation times may be several times shorter than 
the theory predicts and may not have the angular depen- 
dence displayed by Donoho’s results. However, Standley 
and Vaughan [24] made measurements on ruby grown by 
the so-called vapor phase modification to the flame-fusion 
technique, and found considerably better agreement with 
Donoho’s theory for concentrations up to 0.2 percent. It 
is thus likely that the source of the disagreement for the 
rubies not grown by the vapor phase method is depen- 
dent on the crystal growth process. If the Cr ions are 
not uniformly distributed, then cross-relaxation between 
exchange-coupled pairs or clusters is a possibility. Crystal 
imperfections and other magnetic impurities (e.g., Fe 3+ ) 
could similarly affect the relaxation behavior of the Cr 
ions. Standley and Vaughan [5] discuss these possibili- 
ties further. To account for these effects, Donoho’s theory 
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U>41 


would most likely have to be modified to include spin-spin 
interactions. Previous attempts to incorporate spin-spin 
effects have not been very successful [30]. 

Professor Sheldon Shultz at the University of Cali- 
fornia, San Diego is presently under contract with JPL 
to perform T\ measurements at 9 GHz and 35 GHz on 
our Czochralski-grown ruby. These measurements are be- 
ing made to determine the optimum Cr concentration in 
a 32-GHz traveling wave maser (TWM) as a function of 
temperature, and also to better understand the relaxation 
mechanism in ruby. Previous measurements [2] have indi- 
cated a strong dependence of inversion ratio and gain on 
Cr concentration, particularly at very low temperatures 
(e.g., ~ 2 K). This effect is a result of spin-spin interac- 
tions, and cannot be explained by Donoho’s theory. 


b = -N 


0> 42 + ^24 
V 43 + W 34 


(53) 


and 




A' = 


’-(W 12 + W 13 ) 

w 12 

w 13 

A + 

(W 12 - W 24 ) 

~(W 12 + 2 W u ) 

-W 24 


(W 13 - W 34 ) 

-W 34 

~(W 13 + 2W 34 ) 


V. Calculation of Gain, Pump Power, 
and Noise Temperature for Ruby 

Gain, pump power requirements, and noise temper- 
ature are important considerations in maser design and 
operation. In principle, each of these quantities can be 
predicted given an accurate knowledge of the spin-lattice 
transition probability rates, 

A. Inversion Ratio 

The gain in the ruby is directly proportional to the 
population difference of the inverted levels. The steady- 
state populations of the levels can be determined as a func- 
tion of pump power from the steady-state rate equations: 


hi =0 = ^2 [ w i« n j “ + Wij (rij - n,)j 

i = 1, 2, 3, 4 


j = 1 


(51) 


where Wij is the stimulated transition rate due to RF 
pumping given by Eq. (5). Applying conservation of spins, 
Eq. (51) may be reduced to a set of three equations, 


A'h = b 


(52) 


where n is given by Eq. (40), 


(54) 


where A is given by Eq. (42). The population vector n 
is determined by matrix inversion. Note that the Wij 
terms present in 6 and A' are restricted to W13, W24, W12, 
and W 34 , corresponding to the transitions that would be 
pumped in the double pumping schemes shown in Fig. 3. 
(The choice of these pumping schemes is discussed in [9].) 

It is assumed that the signal transition is far from sat- 
uration, so fFsignai can be neglected compared to W pU mp 
and u>ij. (Actually, W 8lgn?i \ could be > u>ij for some tran- 
sitions, but it is assumed this has negligible effect on the 
populations.) 

A convenient measure of the inverted population dif- 
ference is the inversion ratio, defined for the signal transi- 
tion occurring between levels i and j(i > j) as 

/ — n * ~ n j 

Nj - Ni 


where rij and rij are determined from Eq. (52). Plots of I 
versus W pump may be made with temperature, frequency, 
angle, etc. as parameters. 

Figure 4 shows / versus Wpump for / = 32 GHz, 
9 = 54.7 deg, and for a range of temperatures. Figure 5 is 
a similar family of curves for 9 = 90 deg. Note the oppo- 
site dependence of I on temperature for the two pumping 
schemes when the pumping rates are near saturation. This 
is a result of the settling of spins into the lowest energy 
state as h f/kT increases. 
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The inversion ratio corresponding to saturation of the 
pumped transitions can easily be read off of plots of I ver- 
sus Wpump- However, if pump saturation is expected, then 
the saturation inversion ratio can be determined directly 
from a single steady-state rate equation. A simple alge- 
braic equation then results for I. 

Figure 6 shows I versus temperature assuming pump 
saturation for the 8.5-GHz maser pumping scheme [1]. 
Siegman [2] defines the “optimum” and “equal” inversion 
ratios by rewriting the expression for I in terms of a 
pseudo-relaxation time, 


Tij — (ujij + u>ji) (56) 


and then either optimizing I with respect to the or 
simply setting all the T^ equal and calculating the result- 
ing I. (Siegman does not discuss the theory of spin-lattice 
relaxation quantitatively, and therefore has to make as- 
sumptions about the Note that in Fig. 6 the inversion 
ratio for saturation is close to the optimum value possible 
for this pumping scheme. Figure 6 will be compared with 
measurements being performed at JPL. 

Figures 7 and 8 show predicted values I versus tem- 
perature for two pumping schemes for a 32-GHz maser. 
Note that the assumption of pump saturation may not be 
correct for the higher pump frequencies. 

Many measurements of inversion ratio have been per- 
formed by R. Clauss et al. [25, 26] at JPL. Table 2 com- 
pares some of these measured values with predictions from 
the spin-lattice relaxation theory. In most cases, it can be 
assumed that the pump transitions were saturated. The 
agreement between measurement and theory is moderate, 
with predicted values often being 30 percent or more dif- 
ferent from the measured values. 

Inversion ratio measurements have also been per- 
formed by Moore and Neff [27] at 6 — 54.7 deg. Their 
results are shown in Fig. 9 with the theoretical value over- 
layed. The agreement between specific values varies, but 
the trend of decreasing inversion ratio with increasing sig- 
nal frequency is predicted by the theory. Note the large 
scatter in Moore and Neff’s measured values of I at fre- 
quencies below 20 GHz. 

The source of disagreement between the measured and 
predicted values of / is most likely due to spin-spin inter- 
actions, as discussed in Section IV. 


B. Gain of Ruby 

Expressions for maser gain are given in [2]. Consider- 
ing only first-order temperature-dependent terms, the gain 
in dB is 


GdB OC An s j gna i • ^signal (57) 

where An s j gna i = JAA fi i gna i and where A7V s j gna i is deter- 
mined from the Boltzmann relations. Figure 10 shows GdB 
(normalized to GdB at 4.2 K) versus temperature for the 
JPL X-band maser pumping scheme. Since hf/kT 1 
(approximately) even at T = 1.5 K, little departure from 
the expected T -1 behavior (dashed line) is seen [2]. 

For / = 32 GHz, hf/kT « 1 for T = 1.5 K, so some 
deviation of GdB from a T~ l dependence is expected. Fig- 
ure 11 shows the thermal equilibrium population difference 
(normalized to the population difference at 4.2 K) for the 
maser configurations shown in Figs. 3(a) and 3(b). Note 
the strong departure from the T’ 1 curve for the pumping 
schemes employing the 2-3 signal transition. However, the 
temperature dependence of GdB is proportional to I • AN. 
Figure 7 displays an inversion ratio that increases as T 
decreases, so the product I • AN will not deviate greatly 
from a T~ l dependence. 

Figure 12 shows normalized gain as a function of tem- 
perature for the two pumping scenarios corresponding to 
Figs. 7 and 8. Both curves are normalized to the gain at 
4.2 K for the scenario employing the 2-3 signal transition, 
and both assume pump saturation. The plots indicate that 
the 90-deg scheme (1-2 signal transition) will yield greater 
ruby gain for all temperatures in the range of 1.5 K to 
4.2 K. 

C. Pump Power 

After the pump transition rate Wij is determined for 
a given inversion ratio, the corresponding pump power can 
be calculated from Wij . To simplify the expression for Wij 
as given by Eq. (5), Siegman [2] defines the vector 

^ = (*,151*,) (58) 


where cq-j is a measure of the strength of the i,j transition. 
Equation (5) then becomes 


Wii = 77 2 g(f) 


H, 


(59) 
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The maximum value of crf ; - occurs when H\ and S are 
parallel. Siegman further defines 

Oij = | (<*.; X + \^1 Pij y + 7 ij *) (60) 


The vector components a*;, /?,j, Jij are deter mined Jxom 
the spin Hamiltonian. The optimum polarization of Hi is 
then easily found by maximizing H x * < 7 , 7 . 

Rather than outlining a general procedure for cal- 
culating pump power, a specific calculation for a single 
channel of the 32-GHz reflected wave maser (RWM) [28] 
is given. The RWM consists of 8 channels of ruby-filled 
waveguide in which both the signal and pump power prop- 
agate in the TE 10 mode. (Note that at the pump frequency 
the waveguide can support higher-order modes; here it is 
assumed that none of these is excited.) 

The geometry of a single waveguide channel is shown 
in Fig. 13. Note that the z axis is defined to be parallel to 
the dc field B , unlike Fig. 1 in which the z axis is parallel 
to the crystal c axis. Thus, the Wij vectors computed from 
the spin Hamiltonian given by Eq. (1) must be rotated 
through an angle 0 — 54.7 deg to apply to Fig. 13. 

The RWM employs the push-pull pumping scheme 
(Fig. 3b) at the double-pump angle 6 = 54.7 deg. The 
pump transitions are the 1-3 and 2-4 at 66 GHz, and the 
signal transition is the 2-3 at 32 GHz. The components 
are the following: 


p 

w 

II 

1 

o 

to 

013 = 0.303 

7i3 = -0.198 


024 — — 0.257 

/?24 = 0.236 

724 = -0.198 

(61) 

c *23 = -1.96 

/?23 = 1*96 

723 — 0.085 



Concentrating on the 1-3 pump transition, note that 


The magnetic field of the TE 10 rectangular waveguide 
mode is [29] 



x jtfx+ut) (63) 


where Eo is the electric-field amplitude, Zd — \J H oA * s 
dielectric impedance, A<* = Af rees pace/V^ is the dielectric 
wavelength, and (3 is the mode propagation constant. For 
ruby e r ~ 10, so Zd ^ 120Q and A d ~ 0.0566 in. The 
RWM guide dimensions are a = 0.10 in. and b = 0.05 in. 
[28]. From Eq. (63), H 1 is circularly polarized for 


Aj 



which has the solutions y « 0.1a and y « 0.9a. Forming 
the product H 1 * ^ 13 , it is found that the pump RF field 
is most strongly coupled to the spins at y & 0.1a. 

Neglecting losses and ruby absorption, the time- 
average power flow down the waveguide is [29] 

Pavg = / m 2 dA ( 65 ) 

cross 

section 


where H t is the transverse component of Hi and 



073 


<*13 

2 


+ jy ~ 



(62) 


Evaluating Eq. (65) and solving for Eq, one obtains 


where a ,3 « 0.30. 


Eo 


{ 4 ■ P avR • Zte 

V ab 



(67) 
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Assuming P avg = 50 mW per channel [28], an electric-field 
amplitude E 0 = 2.78 x 10 3 V/m is obtained in the guide. 

Substituting Eq. (62) into Eq. (59) and noting that 
Hi has no z component, 


pumping scheme used in the RWM, df p /df 3 « 2. A typi- 
cal broadened linewidth for the RWM is A/, = 400 MHz. 

Evaluating Eq. (70) for the above values, and noting 
that 7 = 2.2 x 10 5 m/Coulomb, 


^13 = jq q i 37 2 9(f) H i • (* - Jy) ( 6g ) 


Wl3 


( 780 sec 1 unbroadened 
^90 sec -1 broadened 


(73) 


Since Hi varies as a function of y, an average value of the From Fig. 4, the corresponding inversion ratios at T = 

modulus-squared quantity in Eq. (68) is required. Evalu- 4.5 K are 
ating the average, one obtains 


1 

a 



• ( x - jy) 



(69) 


{ 1.3 unbroadened 
0.8 broadened 


(74) 


Equation (68) averaged over the waveguide cross section 
becomes 

("'“L = (f)’ < 70 > 


These values compare favorably with the corresponding 
experimental values. Moore and Neff [27] found the maxi- 
mum inversion ratio to be I = 1.1 to 1.2. (The saturation 
value of I in Fig. 4 is I « 1.5). Shell and Neff [28] de- 
termined the inversion ratio of the RWM broadened to 
400 MHz to be / « 0.7 to 0.8. 


To evaluate Eq. (70), one needs to know the line 
shape function y(/). For an unbroadened Lorentzian line 
of width A fi, the line shape at the pump frequency is [2] 


9(fp ) « 


2 

f L 


(71) 


D. Noise Temperature 

Amplifier noise temperature T' is plotted versus phys- 
ical temperature T with the resistive loss quantity (5 as a 
parameter for the two 32-GHz pumping schemes shown in 
Figs. 7 and 8. Following Stelzreid [12], the amplifier noise 
power is 


where A fi « 60 MHz for ruby. 


Pn.amp = G kT' a B (75) 


For a line broadened by a stagger-tuned dc field and 
assuming that the pump sweep rate >> the pump level 
relaxation time, the line shape is approximately [2] 


where G is the amplifier gain and B is the bandwidth. To 
make this definition valid for hf /kT' a £ 1, define 


9(f) = 


_i — f _ AZil < f < f + AZ* 
WT 7 T j p 2 ^ ^ h ^ 2 


0 elsewhere 


(72) 


hL 



(76) 


where T a is defined by the black-body noise power 


where A f p and A/, are the pump and signal linewidths, 
and df p /df s is evaluated by differencing or by applica- 
tion of the Hellman-Feynman theorem [7]. For the 32-GHz 


P n,amp — G~ 


hfB 

vk ~ i 


(77) 
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Noise in masers is discussed thoroughly in [2]. The 
two primary noise mechanisms are spontaneous emission 
of photons by the ruby spins and conductor losses in the 
microwave circuitry. The noise power of a TWM is [2] 


Pn 


amp 



Pn(~T m ) + 



where 


Pn(T 0 ) = 


hfB 


JlL 

e*To - 1 


(79) 


and 


Q _ e 2(or m — ao)L 


( 80 ) 


where a m and a 0 are the ruby gain and forward loss coef- 
ficients, respectively, of the TWM, and L is the structure 
length. T m = \T S | is the magnitude of the spin tempera- 
ture defined by the ratio of the inverted spin populations: 

— = e“ T&i i > j (81) 

n j 

For G > 1, Eqs. (75), (78), and (79) yield a noise 
temperature 


T t hf 
a K i-/?) 


P 


erf-l 


(82) 


where (3 = ao/^m- Figures 14 and 15 show T ,f a versus T 
with /? as a parameter and assuming pump level satura- 
tion. The figures display a slightly lower T' a at a given 
physical temperature and (3 for the 90-deg orientation. 
The difference arises from the lower T m for the 90-deg 
pumping scheme. Note that at T = 1.5 K the effect of (3 
is diminished. Estimates of /? for a 32-GHz TWM are not 

I 


available at this time. For the RWM, Shell and Neff [28] 
estimate /? ~ 0.1. 


VI. Conclusions 

This article shows that by accounting for the rate of 
spin relaxation from higher energy levels, predictions of 
many basic maser parameters such as gain, pump power, 
and noise temperature can be made. For ruby, the dom- 
inant source of spin relaxation is interaction of the Cr 3 + 
ions with thermal vibrations of the AI2O3 lattice. Based 
on measurements found in the literature which character- 
ize the strength of the coupling between the spins and the 
lattice, quantitative estimates of the spin relaxation rates 
have been made. 

Relaxation time measurements are presently under- 
way at 9 GHz and 35 GHz as a check of the theory and 
to determine the best orientations and Cr concentration 
of the ruby crystal for a 32-GHz maser. Previous relax- 
ation time measurements have shown varying degrees of 
agreement with the theory, perhaps due to the presence 
of impurities or due to clustering of Cr ions. To account 
for impurities or clustering, the theory would have to be 
modified to account for spin-spin interactions. 

Comparison of predicted inversion ratios with mea- 
sured values also shows varying degrees of agreement, but 
the agreement is generally good. More detailed experimen- 
tal studies of inversion ratio and gain for various levels of 
pump power and as a function of physical temperature for 
the 54.7-deg and 90-deg ruby orientations are necessary at 
32 GHz. 

The theory predicts that at 32 GHz and assuming 
pump saturation, the gain for the 90-deg orientation (1- 
2 signal transition) should be nearly 50 percent higher in 
dB than for the 54.7-deg orientation (2-3 signal transition). 
The theory also predicts a slightly lower noise temperature 
for the 90-deg orientation. Although specific calculations 
have not been performed, it is expected that the 90-deg 
orientation will require « 50 percent greater pump power 
than the 54.7-deg orientation. 

The usefulness of the present theory remains to be 
seen. It may be found that spin-spin interactions cannot 
be neglected even for the relatively dilute ruby used in 
maser amplifiers. The theory is, however, a significant step 
towards a more complete understanding of ruby masers. 
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Table 1. Comparison of Donoho's [4] calculated values 
of Aj and T\ with those of the author for 6 = 54.7 deg, 
f = 9.3 GHz, and T = 4.2 K 


0 = 54.7 deg, transition = 2-3 


Donoho’s 

Author’s 


Calculation [4] 

Calculation 

T\ 

0.38 

0.20 

t 2 

0.22 

0.11 

t 3 

0.13 

0.07 

A: 

0.99 

1.00 


0 

0 

A 3 

0 

0 

9 = 54.7 deg, transition — 3-4 


Donoho’s 

Author’s 


Calculation [4] 

Calculation 

Ti 

0.97 

0.51 

t 2 

0.60 

0.30 

t 3 

0.22 

0.11 

A i 

0.50 

0.26 

A 2 

0.41 

0.49 

a 3 

0.09 

0.25 
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Table 2. Comparison of measured inversion ratio with theoretical 
predictions for 0.05 percent Cr ruby at 4.5 K 


Transition(s) and 
Frequency, GHz 


0, deg 

H 0> kG 

Signal 

Pump 

/opt 

/ meets 

jtheory 

54.7 

5.72 

2-3 

1-3, 2-4 

3.6 

to 

bo 

& 

2.2 



14.4 

32.9 




54.7 

5.56 

2-3 

1-3, 2-4 

3.6 

3.0 a ’* 

2.3* 



13.9 

32.0 




54.7 

6.78 

2-3 

1-3, 2-4 

3.4 

3.3 

2.0 



17.2 

28.0 




54.7 

3.7 

2-3 

1-3, 2-4 

4.2 

2.5 

3.0 



8.5 

21.9 




54.7 

6.6 

2-3 

1-3, 2-4 

3.4 

2.5 

2.0 



16.8 

37.3 




90 

5.0 

1-2 

1-4 

4.1 

2.5 

3.6 



8.5 

43.5 




90 

5.0 

1-2 

t— * 
i 

03 

03 

4.1 

3.2 

3.6 



8.5 

24.2, 19.3 




90 

6.8 

1-2 

1-3, 3-4 

3.3 

3.0 

2.7 



13.6 

33.9, 24.6 




90 

7.1 

1-2 

1-3, 3-4 

3.2 

2.9* 

2.4* 



14.2 

35.2, 25.2 




90 

8.0 

1-2 

1-4 

3.1 

-0.1 c 

2.5 



16.8 

68.3 




90 

8.0 

1-2 

1-3, 3-4 

3.1 

2.9 

2.4 



16.8 

40.3, 28.0 





a 0.075 percent Cr 
6 1.8 K 

c Pump not saturated 



x; a AXIS 


Fig. 1. Geometry of Cr 3 + site. 


3x4 



+ S-0 COUPLING ZEEMAN 

SPLITTING 

Fig. 2. Simplified energy splitting diagram for Cr in AI 2 O 3 lattice. Sequence of 
quantum mechanical perturbations applied is shown, m x n = m-fold orbital 
degeneracy and n-fold spin degeneracy. 


4 
3 
2 

1 

(a) (b) <c) 

Fig. 3. Double-pumping schemes being considered for 
ruby for a 32-GHz maser. The downward arrow is the 
32-GHz (signal) transition. 
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INVERSION RATIO, I 2! INVERSION RATIO, I 


6 



4. I versus W pump for 0 = 54.7 deg, H = 11.8 kG, 
signal = 2-3 transition = 32 GHz. 



Fig. 6. I versus T assuming saturation for 9 = 90 deg, 
H = 5 kG, signal = 1-2 transition — 8.5 GHz. 



Fig. 5. I versus W pump for 0 = 90 deg, H = 13.5 kG, 
signal = 1-2 transition = 32 GHz. 



Fig. 7. I versus T assuming saturation for 6 = 54.7 deg, 
H = 11.8 kG, signal = 2-3 transition = 32 GHz. 
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Fig. 9. Measured and theoretical values of I versus sig- 
nal frequency for ruby, 6 = 54.7 deg, T = 4.5 K. 
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TEMPERATURE, K 

o 1 2 3 4 5 Fig. 12. G,jb versus T for 54.7-deg and 90-deg pumping 

temperature, k schemes (assuming saturation). 

Fig. 11. Thermal equilibrium population difference for 
two 32-GHz pumping scenarios (normalized to popula- 
tion difference of the 54.7-deg scenario at 4.2 K). 








AMPLIFIER NOISE TEMPERATURE <TL), K 



Fig. 13. Geometry of a single channel of ruby-filled 
waveguide for 32- GHz RWM. 



ol l l I i I 

0 12 3 4 5 

PHYSICAL TEMPERATURE, K 


Fig. 14. Ta versus temperature for 32-GHz ruby maser, 
9 = 54.735 deg, signal = 2-3 transition = 32 GHz. 
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Performance Effects of Tie-Truss Modifications for a 
70-Meter Centerline Beam Waveguide Antenna 

J. J. Cucchissi 
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The elevation-axis tie truss of the 70-m antennas would have to be modified to 
accommodate a centerline beam waveguide. To accomplish this, the center section 
of the tie truss has to be altered, causing a change in the tie-truss compliance and 
affecting structural performance. Even with the center section completely removed, 
the worst-case rms pathlength error due to gravity load is increased from 0.025 to 
only 0.030 inches. Using a simple postprocessor technique, the effects of modifying 
the compliance can be predicted without resorting to multiple and costly reanalyses 
of large finite-element models on a mainframe computer. 


I. Introduction 

With the imminent implementation of beam wave- 
guide technology in the Deep Space Network, a study was 
initiated to predict the performance degradation due to 
retrofitting existing antennas in the 70-m subnet with a 
centerline or on-axis beam waveguide. Whereas the 70-m 
antenna structure was designed for a dish-mounted feed- 
cone (Fig. 1) and no allowance made for an on-axis beam 
waveguide, some structural components of the main re- 
flector and elevation wheel would obstruct the path of mi- 
crowave energy of a beam waveguide system. The first 
significant obstruction is a 56-foot tie truss (Figs. 2 and 
3). It is a space truss of triangular cross section that is 
integral with the reflector backup and elevation wheel sub- 
structures and that runs parallel to the elevation axis and 
connects the two elevation bearing castings. To allow a 


beam waveguide shroud to pass through this region, the 
center section would have to be modified or removed. 

The purpose of this study was to determine the effect 
of tie-truss stiffness on the main reflector surface distor- 
tion under gravity load. Specifically, the analyses quanti- 
fied the sensitivity of the gravity rms half-pathlength error 
to the stiffness of the tie-truss center section. Supplemen- 
tal analyses also examined changes to the lower frequency 
modes when the center section is removed completely. 

The sensitivity analyses described herein were per- 
formed using the methods of structural modification re- 
analysis [1, 2] and correlation analysis [2]. The specific 
procedures used are explained briefly; more rigorous de- 
scriptions of the theory and further applications can be 
found in the references. 


II. Methodology 

The 70-m tipping structure comprises the main 
reflector, elevation wheel, subreflector, and quadripod. 
A planar-symmetric finite-element half model of this struc- 
ture includes 6,776 finite elements (primarily axial force 
bars), 1,994 nodes, and 5,982 translational degrees of free- 
dom. Finite-element analysis and microwave-pathlength 
analysis is accomplished via the JPL-IDEAS finite-element 
analysis and design optimization program [3] on the 
UNIVAC. Sensitivity coefficients, derived from the large 
mainframe analysis model, are incorporated into PC-based 
postprocessing software that uses a combination of 
structural modification reanalysis and correlation analy- 
sis to assess the effect on performance when the cross- 
sectional areas of selected structural members are reduced 
or when the members are removed completely. Some 
postprocessor results were verified via mainframe finite- 
element processing. 


Using the calculated member distortions 


Ei = F Pj 

(i) 

Eg = F Pg 

(2) 

Eq = SF = factor * F 

(3) 


and enforcing compatibility, the N x 2 modifier matrix R 
is found by solving 

(E 0 -E s )R = Ei (4) 

A more detailed discussion of reanalysis is presented in [1], 


The model was analyzed first for gravity load and the 
forces in the members in the central region of the tie truss 
were examined (Fig. 3). Those members that carried at 
least one percent of the gravity load and would obstruct a 
centerline beam waveguide path were selected for modifi- 
cation. For this study, nine bars were selected. 


A. Structural Modification Reanalysis 

To model the effects of modifying these selected mem- 
bers, sets of self-equilibrating unit virtual loading pairs 

[1] , called indicator loads, are applied to the finite-element 
model at the terminal nodes of each of the members to 
be altered, one set per member. Static analysis is per- 
formed for each set separately and also for the y and z 
components of the gravity load. The JPL-IDEAS pro- 
gram provides: (1) the forces due to y and z gravity, and 

(2) the forces due to the indicator loadings, for each of the 
members to be altered. The following matrices then can 
be constructed: 


Pi 2x N matrix of original forces for N altered mem- 
bers, z and y gravity rows 

Ps N x N matrix of forces in N altered members for 
indicator loadings 

F iVxiV diagonal matrix of original flexibilities of 
N altered members 

SF N x N diagonal matrix of modified flexibilities of 
N altered members 


B. Correlation Analysis 

After determining the R modifier matrix, the gravity 
loading pathlength error analysis for the modified struc- 
ture can proceed. The pathlength error vectors for z- and 
y-gravity loads after modification can be assembled from 
the independent pathlength error vectors for z and y grav- 
ity before modification and the indicator loads as follows: 


Pz Py\ = \pz 0 Py o + [pi P2 • • • PN R 


(5) 


where pk = best-fit pathlength error vector for load k and 
subscripts z 0 , y 0 refer to z and y gravity response before 
modification, and 1,2,3 ,... , N refer to indicator loads. 

At any elevation angle a, the gravity pathlength error 
vector is 


where 


Pg - 


Pz Py 



( 6 ) 


and £ = sin 7 — sin a, r/ = cosy — cos a, and 7 = rigging 
angle (usually 45 deg), a = any elevation angle, 0 < a < 
90 deg. 
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So, for the modified structure 


Cgn — 


C Z 2 * * ’ CzN 

Cy2 * * ' Cy JV 


Pg = [pzo pyo] c + [pi P2 . . . Pn] R-c 


( 7 ) 


and 


Forming the sums of squares: 


C,i 
Cy 1 


RMS i 


0 


ss = p%p G = c t p£p Go c 

+ c T Pg o P«Rc 
+ c t (pJ 0 P w r) T c 
+ c T R T P£P jV Rc 


RM tv — 


rms 2 


0 • • • • • • RMSn 


(8) 


for N altered members (indicator loads), and the fourth 
term can be shown to be: 


where Pc 0 = \?z a Py 0 ] an< ^ = \pi Pi ■■■ Pn]- 


C — c t R t RM^C/;RMjvR c 


Using the notation Cij — correlation coefficient for 
pathlength error vectors pi and pj , and RM Sk = the best- 
fit rms pathlength error for vector and noting that 
Cij = p? pj/(RMSj * RMSj), the first term in Eq. (8) 
can be shown to be: 


A = c T RM zy C zy RM, y c 


where 


where 



i 

C 12 

C\3 

• • ClN 


C 12 

1 

C 23 

•• C 2 n 

Cij = 

Cl3 

£ 

CO 

1 

■■ C 3N 


C\N 

C 2N 

C3N * 

1 


RM 2 y — 


RMS Z 

0 


RMS V 


and 


C zy — 




1 


the second and third terms can be shown to be: 


B\ = c T RM zy CcivRMjvR c 
B 2 = c T ^RM 7y CgnRMjvR^ c 


where 


The first term is the contribution due to y-gravity and z- 
gravity loads and is assumed invariant. The second and 
third terms are the contributions of each indicator load 
correlated with both y- and 2 -gravity. The fourth term is 
the contribution of each indicator load correlated with the 
other indicator loads. Note that all but the first term re- 
quire the modifier matrix R calculated earlier using struc- 
tural modification reanalysis. 

The JPL-IDEAS program provides the best-fit rms 
pathlength error (RMSk) for each of the loads and the 
correlation coefficient (Cij) for all pathlength-error vector 
pairs. By substituting in these values each term is readily 
calculated. Summing the four terms and taking the square 
root yields the gravity rms pathlength error for a particular 
modification of the N selected members. 

rms =(55j = (.4 + B1 + B2 + CJ (9) 
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For other changes to the N selected members only a new 
R matrix need be calculated. Response over the antenna 
elevation range is computed by varying a from 0 to 90 deg. 


III. Natural Frequency Analysis 

Since the gravity rms pathlength error was not 
severely degraded by removing the center section of the 
tie truss, the normal modes were examined and compared 
before and after removal. Again, the JPL-IDEAS pro- 
gram was used to perform the natural frequency analyses. 
For a half- model structure, two stiffness matrix decom- 
positions are required, one with symmetric and one with 
anti-symmetric boundary restraints, to recover all allow- 
able modes for the full structure. 

Instead of being rigidly fixed at the elevation bear- 
ings, the stand-alone tipping-structure model is connected 
to linear springs simulating the lateral flexibility of the al- 
idade at the elevation bearing locations. This modeling 
provides a more realistic representation of the alidade and 
tipping structure interaction. Furthermore, since only half 
the structure is modeled, the alidade flexibility is repre- 
sented by two springs, one for symmetric and one for anti- 
symmetric boundary conditions. Independent static anal- 
yses of the alidade were performed to determine the lateral 
flexibility of the alidade at the bearings. These compli- 
ances were then incorporated into the tipping-structure 
model for normal modes analysis. 


IV. Numerical Results 

The graph in Fig. 4 shows the change in the gravity 
rms pathlength error over the antenna elevation range for 
a series of bar-area reductions. Each rms achieves a max- 
imum value at the extreme elevation angle of 0 deg; the 
rms is zero, by definition, at the prescribed rigging angle 
of 45 deg. There is little discernible difference in antenna 
performance between the original model and one where the 
tie-truss stiffness was reduced by 80 percent. Even with 
the center section removed, the worst case rms is 0.030 in. 

The extreme values plotted in Fig. 1 are listed in Ta- 
ble 1 under combined gravity rms at 0 and 90 deg, for an 
antenna rigged at 45 deg. Also included are the rms values 
for a 1-g y-gravity load, a 1-g z-gravity load, and the cor- 
relation coefficient for these two loads. The equation that 
accompanies the table shows a simple way to calculate the 
combined gravity response from the y and z components. 
These results reflect only the effect of uniformly downsizing 


each of the nine selected bars in the tie-truss center sec- 
tion by a percentage of their original area. No allowance 
is made for the reduction in gravity load due to reducing 
member areas because the weight of these members rep- 
resents only 0.5 percent of the total gravity (dead) load. 
Also, the solutions assume stiffness reduction only with 
no further optimization of the reflector backup or tie-truss 
structures to compensate for the rms increase. 

When spot-checked with mainframe finite-element 
model analyses, the postprocessor made accurate predic- 
tions as the bar areas were reduced. As the areas ap- 
proached zero, however, the predictions became unreliable; 
the (Eq— Es) matrix became singular, correctly indicating 
unstable nodes in the finite-element model. To accurately 
analyze this case, the bars and any extraneous nodes had 
to be removed from the model, and a mainframe finite- 
element analysis was performed. Those results for the 
center section removed are tabulated as the 100-percent 
area-reduction case. 

In Table 2 is a comparison of the lowest anti- 
symmetric modes, showing the effects of including accu- 
rate representations of the alidade compliance across the 
elevation bearings. The percentage of the total inertia 
about the y (roll) axis and z (yaw) axis contributed by each 
mode is listed in the table under effective modal inertia. 
The anti-symmetric modes above the first were changed 
moderately, as indicated by the frequency shifts and the 
redistribution of the effective modal inertias. However, the 
first anti-symmetric mode, which is essentially torsion of 
the quadripod at 1.28 Hz, was unchanged. This frequency 
is of particular importance because it is the lowest natural 
frequency of the 70-m antenna and, as such, is a critical 
performance constraint. To avoid excitation of the antenna 
modes, the lowest frequency must be outside the position 
loop bandwidth of the antenna drive servo system. 

When expressed as a percentage of the total inertia 
about each axis, effective modal inertias provide general 
information about the mode shape. In the coordinate sys- 
tem used for the tipping structure model (see Fig. 1), a 
value for 9 X indicates a pitch mode, for 9 y a roll mode, and 
for 9 Z a yaw mode. For example, in Table 3 anti-symmetric 
mode 2 after removal is a mixture of roll and yaw, repre- 
senting 12.1 percent of the inertia about the y axis and 5.6 
percent of the inertia about the z axis. Mode 3 is also a 
mixed mode, although predominantly yaw and represent- 
ing 23.9 percent of the inertia about the z axis. Highly 
localized vibrations or modes with a small fraction of the 
rotational inertia, such as torsion of the quadripod, will 
appear as small values in these tables; examination of the 
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eigenvector is required to determine the character or mode 
shape of these vibrations. 

Tables 3 and 4 compare the natural frequency per- 
formance of the antenna before and after the complete re- 
moval of the tie-truss center section. The analyses incorpo- 
rated appropriate alidade springs with different symmet- 
ric and anti-symmetric mode properties. Both the anti- 
symmetric (antenna roll and yaw) and the symmetric (an- 
tenna pitch) modes were unaffected by the removal. 


V. Summary 

Analysis indicates that the center section of the ele- 
vation tie truss can be removed with a predicted degra- 


dation in gravity pathlength error of 0.005 in. rms. In 
addition, the removal does not compromise the natural 
frequency performance. In the model, an alternate load 
path through an adjacent structure must have been in ef- 
fect to compensate for the tie- truss removal. Before any 
modification of the existing tie truss is recommended, this 
load path must be clearly defined and the integrity of the 
structural elements composing this path must be verified. 

The postprocessor techniques used in the study 
simplified the analysis and provided accurate and reliable 
results when compared to the mainframe finite-element 
analyses. They are also inherently self-checking. When 
the predictions became unreliable, the postprocessor cor- 
rectly indicated that an instability existed in the finite- 
element model. 
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Table 1. Stiffness reduction effects on RMS pathlength error 


Area or stiffness 
reduction, percent 

l-g y-gravity 
RMS, in. (cm) 

l-g z-gravity 
RMS, in. (cm) 

Correlation 
coefficient, C zy 

Combined gravity 0 
RMS, in. (cm) 

EL 0 deg 

EL 90 deg 

0 

0.0274 (0.0696) 

0.0354 (0.0899) 

0.1095 

0.0254 (0.0645) 

0.0210 (0.0533) 

20 

0.0275 (0.0699) 

0.0356 (0.0904) 

0.1146 

0.0255 (0.0648) 

0.0210 (0.0533) 

40 

0.0276 (0.0701) 

0.0359 (0.0912) 

0.1214 

0.0257 (0.0653) 

0.0210 (0.0533) 

60 

0.0281 (0.0714) 

0.0365 (0.0927) 

0.1303 

0.0260 (0.0660) 

0.0213 (0.0541) 

80 

0.0292 (0.0742) 

0.0375 (0.0953) 

0.1404 

0.0267 (0.0678) 

0.0220 (0.0559) 

100 

0.0315 (0.0800) 

0.0412 (0.1046) 

0.1291 

0.0294 (0.0747) 

0.0239 (0.0607) 

a RMS a = (rfRMSl + (?RMS\ + 2 nCRM S V RM S t C zy 

for a = 0 deg, 7 = 45 deg 
T) = —0.2929 
C = 0.7071 

and for a = 90 deg, 7 = 45 deg 
rj = 0.7071 
C= -0.2929 

f 




Table 2. Comparison of anti-symmetric modes with and without simulated 

alidade compliance 


With alidade compliance Without alidade compliance 


Mode 


Effective modal 


Effective modal 


Frequency, 

Hz 

inertia, 

<>v 

percent of total 

9r 

Frequency, 

Hz 

inertia, 

percent of total 

6z 

1 

1.281 

— 

1.1 

1.281 

— 

1.0 

2 

1.489 

12.1 

5.6 

1.594 

24.8 

2.7 

3 

1.719 

13.8 

23.9 

1.769 

2.2 

60.8 
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Table 3. Comparison of anti-symmetric modes before and after removal of tie-truss center section 


Before removal 


After removal 


Mode Effective modal Effective modal Mode shape 

Frequency, inertia, percent of total Frequency, inertia, percent of total 

Hz By B z Hz By e z 


1 

1.281 

_ 

1.0 

1.281 

_ 

1.1 

Yaw (Quad Torsion) 

2 

1.491 

12.7 

4.5 

1.489 

12.1 

5.6 

Roll + Yaw 

3 

1.736 

13.0 

21.5 

1.719 

13.8 

23.9 

Yaw -f Roll 


Table 4. Comparison of symmetric modes 


before and after removal of tie-truss center section 


Mode 


Before removal 


After removal 

Mode shape 

Frequency, 

Hz 

Effective modal 
inertia, percent of total 

B x 

Frequency, 

Hz 

Effective modal 
inertia, percent of total 
B x 

1 

2.389 

41.6 

2.381 

42.4 

Pitch 

2 

2.853 

10.9 

2.847 

10.5 

Pitch 

3 

3.262 

2.3 

3.255 

2.1 

Pitch 
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Fig. 1. Isometric view of the 70-m antenna showing 
coordinate system of the tipping structure. 
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This article compares the performance (bit-error rate vs. signal-to-noise ratio) 
of two different interleaving systems , block interleaving and the newer helical in- 
terleaving. Both systems are studied with and without error forecasting. Without 
error forecasting, the two systems have identical performance. When error fore- 
casting is used with shallow interleaving , helical interleaving gains, but less than 
0.05 dB , over block interleaving. For higher interleaving depth, the systems have 
almost indistinguishable performance. 


I. Introduction 

As NASA wants to receive more data from plane- 
tary missions, and as technologies like data compression 
make projects tighten their error-rate requirements, many 
missions are turning to concatenated coding schemes with 
Reed-Solomon (RS) codes as “outer” codes concatenated 
with “inner” convolutional codes, which have been used in 
the Deep Space Network for years (Fig. 1). RS decoders 
for use beginning in the mid-1990s will be installed in the 
network by February 1, 1992. 

When concatenated coding is used, symbols in RS 
words are generally “interleaved,” so that the error cor- 


recting ability of one RS word is not used up entirely by 
one or two long error bursts from the Viterbi (convolu- 
tional) decoder. Some missions planning to use RS codes 
in the Deep Space Network will have “block” interleaving 
depth five. This means that consecutive symbols in a RS 
word are separated on the channel by exactly four other 
RS symbols, one from each of the other four words in the 
block (Fig. 2). 

Recently, other interleaving schemes have been sug- 
gested, such as E. Berlekamp’s “helical interleaving” [1, 
2]. In this article, the initial analysis of this scheme is 
presented. In helical interleaving, words go into the inter- 
leaver/deinterleaver in the staggered way shown in Fig 3. 
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(Notice that if a block-interleaved code block were pasted 
onto a cylinder, the set of symbols consisting of the jth 
symbol in each word lies on a circle around the cylinder; 
if the helically interleaved words were pasted onto a cylin- 
der, the set consisting of the jth symbol in each word lies 
on a helix.) Helical interleaving is defined precisely in Sec- 
tion II. 

It is worth noticing that for a given interleaving depth, 
the two schemes yield statistically identical code perfor- 
mance. As far as any one codeword is concerned, it con- 
sists of certain RS symbols; its decoding does not depend 
on which other codewords the intervening symbols belong 
to. So why this apparently more complicated interleaving 
scheme? There are two ways in which we might expect 
helical interleaving to be different from block interleaving. 
The first is synchronization: if a synchronization marker 
is placed before or at the beginning of each codeword in a 
block-interleaved block, then several markers appear con- 
secutively at the beginning of each code block; with helical 
interleaving, the markers would be uniformly scattered. 
This should allow faster (though possibly more compli- 
cated) synchronization, but we do not address this issue 
here. The second way in which helical interleaving can be 
expected to perform differently from block interleaving is 
in error forecasting. For several years, the DSN’s planning 
for RS decoders has taken into account the slight perfor- 
mance gain available from the fact that RS codes can use 
quality information in decoding; a symbol that is believed 
questionable and flagged as an “erasure” costs only half 
an error [3]. 

The error-forecasting scheme used in this article is a 
simple but effective one: that is to create erasures in those 
words that have too many symbol errors [4] by assuming 
that errors in adjacent words continue in the undecodable 
word. Based on the above idea, there are two ways to flag 
an erasure; that is, either flag a symbol of an undecod- 
able word as an erasure if either one of the code symbols 
next to it on the channel is detected as erroneous (single- 
sided forecasting), or flag a symbol as an erasure only if 
the code symbols on both sides of it on the channel are 
detected as erroneous (double-sided forecasting). Simula- 
tions show that in all cases the more aggressive single-sided 
forecasting scheme performs better than the conservative 
double-sided forecasting scheme. Thus we choose to use 
the single-sided forecasting scheme in all our simulations 
in this article. Since the Viterbi decoder, which decodes 
the inner convolutional code, creates errors in bursts, pre- 
vious analysis [5] shows that in the case of block interleav- 
ing, using this simple error- forecasting strategy allows a 
gain of about 0.04 dB. 


For block interleaved data, words go into the inter- 
leaver as shown in Fig. 2. Each interleaving block is sep- 
arated from others, and all possible error forecasting can 
be done within the same block. Thus a finite buffer can 
do all error forecasting. In the light of this finite buffer 
feature, error forecasting with redecoding to each inter- 
leaving block was selected, disregarding the order of de- 
coding of codewords in each block. That is, when a code- 
word in a block is decoded successfully, erasure informa- 
tion is generated for both of its adjacent words in the block. 
After the first round of decoding in a block, the previ- 
ously undecoded codewords are then redecoded using the 
additional erasure information. This process goes on until 
no more undecodable words can be recovered. For heli- 
cally interleaved data, words go into the interleaver in the 
staggered way shown in Fig. 3. Unlike the block inter- 
leaving scheme in which each codeword is “related” only 
to words in the same block, each codeword in the helical 
interleaving scheme is theoretically “related” to all words 
before and after it, and this makes error forecasting with 
redecoding impractical, if not impossible, since it would 
require a buffer with infinite memory. Thus in helical 
interleaving one does error forecasting without redecoding. 
That is, when a codeword is successfully decoded, erasure 
information is generated to those symbols (of other not yet 
decoded codewords) that are adjacent to the previously er- 
roneous symbols of the decoded codeword. Each codeword 
is then decoded in order, first without using the erasure 
information. If the codeword is decodable, the decoder 
goes on to decode the next codeword. If not, the code- 
word is decoded once again using the erasure information 
generated from previously decoded words. Whether the 
codeword is decodable or not (with erasure information), 
the decoder goes on to decode the next codeword. 

For deep interleaving, both block and helical inter- 
leaving give nearly identical error forecasting capability 
because both schemes are essentially “ideal interleaving” 
schemes which allow nearly statistically independent RS 
symbols. But for shallow helical interleaving (e.g., depth 2, 
which is used by Galileo), a word with too many symbol 
errors to decode is preceded by a part of one word and part 
of another. On the one hand, this means that a Viterbi 
decoder error burst is less likely to keep both different 
codewords from decoding, and so the error forecasting ca- 
pability should be enhanced. On the other hand, only 
some of the symbols in a given codeword are preceded by 
symbols in a codeword that is decoded before it. For the 
exact schemes we compare in Section III, the trade-off be- 
tween these effects depends on the interleaving depths and 
the inner convolutional codes. For interleaving depth 2, 
helical interleaving gives a slight performance gain over 
block interleaving with both the (7,1/2) and (15,1/4) inner 
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convolutional codes. For interleaving depth 4 or 8, helical 
interleaving is slightly better or slightly worse than block 
interleaving, depending on whether the (15,1/4) code or 
the (7,1/2) code is used as the inner code. 


II. Helical Interleaving 

In this section, mathematical definitions of the inter- 
leaving schemes are included for completeness, but the 
reader is advised to consider Figs. 2 and 3 to illustrate 
these definitions. 

Definition 1. A block code with n code symbols 
per codeword is block interleaved to depth d if the code 
symbols sent on the channel are packed into codewords 
in such a way that the code symbol that follows (on the 
channel) the j th symbol of the mth codeword, m < d, is 
the j th code symbol of the (m + l)th codeword, and the 
code symbol that follows the j th code symbol of the dth 
codeword, j < n, is the ( j + l)th code symbol of the first 
codeword, and the code symbol following the nth code 
symbol of the dth codeword is the first symbol of the first 
codeword in the next block. 

Definition 2. A block code with n code symbols 
per codeword is helically interleaved to depth d, where 
dr = n — 1 and r is any integer, if code symbols are packed 
into codewords in such a way that the code symbol that 
follows (on the channel) the jth code symbol of the mth 
word, m < d, is the ( j — r)th code symbol of the (ra+ l)th 
codeword; if j — r < 0, then this is the ( j — r + n)th code 
symbol of the mth codeword of the “previous” group. The 
j th code symbol of the dth codeword is followed by the 
(j + (d — l)r)th code symbol of the first codeword of this 
group if j < r and by the ( j — r)th code symbol of the first 
codeword of the next group if j > r. 

Helical interleaving can be defined for codeword 
length n and interleaving depth d as long as n and d are 
relatively prime. While this more complicated definition 
for interleaving depths 4 and 8 was needed in the simula- 
tions, technical definitions will not be written here. The 
interested reader can find them in [1]. 

Helical interleaving was introduced in 1982 [1, 2] by 
E. Berlekamp. It has since been successfully implemented 
in several hardware decoders. Figure 3 depicts a helical 
interleaver for a code of length 4 interleaved to depth 3. As 
discussed in Section I, use of a helical interleaving scheme 


in general does not enhance the gain performance of a 
concatenated coding system. 

For many applications, it is better than block inter- 
leaving because of synchronization and interleaving delay. 
Helical interleaving facilitates synchronization in the fol- 
lowing way: in block interleaving, the j th symbols of two 
adjacent codewords in a block are adjacent to each other 
in the channel; in helical interleaving, the jth symbols of 
two adjacent codewords are separated from each other in 
the channel by n — 1 symbols in a block-interleaved sys- 
tem. If a synchronization marker is placed before or at the 
beginning of each codeword in a block-interleaved block, 
several markers appear consecutively at the beginning of 
each code block in the channel symbol streams, and the re- 
ceiver can acquire only synchronization modulo nd. With 
helical interleaving, the markers are uniformly scattered, 
and this allows synchronization modulo n. The difference 
in interleaving delay is that the end-to-end delay of a block 
interleaver is 2nd, exclusive of the channel delay, whereas 
the delay of a helical interleaver is only nd . 


III. Simulation 

The performance of the concatenated coding systems 
described in this section were obtained by simulation. The 
inner codes used were the standard (7,1/2) convolutional 
code, which was simulated with a software Viterbi decoder, 
and Galileo’s experimental (15,1/4) code, which was de- 
coded using the Communications Systems Research Sec- 
tion’s long constraint length “Little Viterbi Decoder” built 
for Advanced Systems. The outer codes used were the 
standard (255,223) RS code, which has been adopted for 
use by a number of present and future missions, and the 
proposed (1023,959) RS code for deep-space missions of 
the future. 

Figures 4 through 11 give the SNR performance com- 
parisons of various concatenated schemes using block inter- 
leaving and helical interleaving. Figures 4 and 5 show the 
performance curves of the concatenated coding schemes 
using the (7,1/2) convolutional code as inner code, and the 
(255,223) RS code as outer code, without and with error 
forecasting respectively. Similarly, Figs. 6 and 7 show the 
performance curves of the concatenated coding schemes 
with the (7,1/2) convolutional code and the (1023,959) 
RS code. Figures 8 and 9 show the performance curves of 
the concatenated schemes using the (15,1/4) convolutional 
code and the (255,223) RS code. And finally, Figs. 10 and 
11 show the performance curves of the concatenated cod- 
ing schemes using the (15,1/4) convolutional code and the 
(1023,959) RS code. 
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IV. Conclusion 

It is observed from Figs. 4 through 11 that, as pointed 
out in Section I, when no error- forecasting strategy is used, 
both helical interleaving and block interleaving perform 
identically for all interleaving depths. When error forecast- 
ing is used, helical interleaving gains less than 0.05 dB at 
shallow interleaving. For deeper interleaving (e.g., depth 4 
or depth 8), helical interleaving is slightly worse or slightly 
better than block interleaving, depending on whether the 
(7,1/2) convolutional code or the (15,1/4) convolutional 


code is used as inner code. The difference in performance 
between helical and block interleaving for higher depths 
might be attributed to the fact that the (15,1/4) code pro- 
duces longer bursts than the (7,1/2) code. 

On the basis of these results, it is recommended that 
helical interleaving be considered for possible use only in 
particular situations, and not for general use. As usual, 
the real moral seems to be that one should allow greater 
interleaving depth, whatever interleaving scheme is used. 
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Fig. 1. Concatenated coding system. 
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Fig. 2. Example of block interleaving. 
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Fig. 3. Example of helical interleaving 
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Fig. 4. (7,1/2) convolutional code and (255,223) 
Reed-Solomon code; no error forecasting. 
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Fig. 5. (7,1/2) convolutional code and (255,223) 
Reed-Solomon code with error forecasting. 
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Fig. 6. (7,1/2) convolutional code and (1023,959) 
Reed-Solomon code; no error forecasting. 



1.7 1.8 1.9 2.0 2.1 2.2 2.3 

SIGNAL-TO-NOISE RATIO 


Fig. 7. (7,1/2) convolutional code and (1023,959) 
Reed-Solomon code with error forecasting. 
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Fig. 9. (15,1/4) convolutional code (255,223) 
Reed-Solomon code with error forecasting. 


Fig. 8. (15,1/4) convolutional code and (255,223) 
Reed-Solomon code; no error forecasting. 





Fig. 10. (15,1/4) convolutional code (1023,959) 
Reed-Solomon code; no error forecasting. 
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On the Decoder Error Probability of Linear Codes 

K.-M. Cheung 

Communications Systems Research Section 

In this article , by using coding and combinatorial techniques , an approximate 
formula for the weight distribution of decodable words of most linear block codes 
is evaluated. This formula is then used to give an approximate expression for the 
decoder error probability Pe(u) of linear block codes , given that an error pattern 
of weight u has occurred . It is shown that Pe(u) approaches the constant Q as u 
gets large , where Q is the probability that a completely random error pattern will 
cause decoder error. 


I. Introduction 

Coding is used in a digital communication system to 
detect and correct errors introduced in the data stream by 
channel noise. An important parameter to evaluate the 
performance of a code is its decoder error probability. Let 
C be a linear (n,Jfc,(f) code over GF(q ), and C 1 be its 
(n, n — ifc, d x ) dual code. Let t be the number of errors the 
code is designed to correct. Let G be the generator matrix 
of C. Let A u denote the number of codewords of weight it, 
and D u denote the number of decodable words of weight 
u. Decodable words are defined as all words lying within 
distance t from a codeword. If the decoder is assumed to be 
a bounded distance decoder, then the weight distribution 
for the decodable words can be used to find the decoder 
error probability of the code. 

When a codeword c € C is transmitted over a com- 
munication channel, channel noise may corrupt the trans- 


mitted signals. As a result, the receiver gets a corrupted 
version of the transmitted codeword c + e, where e is an 
error pattern of some weight u. If it < f, then a bounded 
distance decoder on the receiver’s end detects and corrects 
the error e, and recovers c. If u > J, the decoder fails, and 
it either 

(1) Detects the presence of the error pattern e, but is 
unable to correct it, or 

(2) Misinterprets the received pattern c+e for some other 
codeword c' if the received pattern falls into the radius 
t of the Hamming sphere around c'. 

Case (2) is, in most cases, more serious than case (1). 
This can occur when an error pattern e is of weight u > 
d — t. As pointed out in [1] and [2], if all error patterns of 
weight u are equally probable, the decoder error probabil- 


ity 


ity given that an error pattern of weight u occurs, denoted 
by Pe(u), is given by the following expression: 

PeW = (>- W (1) 

In this article, by using combinatorial and coding tech* 
niques, an approximate formula for the weight distribution 
of decodable words for most linear block codes is evaluated. 
This formula together with Eq. (1) gives an approximate 
formula for the decoder error probability Pe(u) for most 
linear block codes. It is also shown that 


Pb{u) 


large u 


Q 


where Q is the probability that a completely random error 
pattern will cause decoder error. That is, 


Q ( l ~ QVnft) 


q- r Vn(t) 


( 2 ) 


where r = n — k is the code’s redundancy and V n (t) = 
ELo (?)(^ — 1)* is the volume of a Hamming sphere of 
radius t. 

II. Mathematical Preliminaries 

In this section, combinatorial and coding techniques 
required to derive the results in later sections are intro- 
duced. These techniques are similar to those used in [6] to 
obtain the weight distribution of linear block codes. 


A. Principle of Inclusion and Exclusion 

Let x be a set of N objects, and P(l), P(2), . . . , P(u) 
be a set of u properties. Let N{i \ , * 2 » • • • > *r ) be the number 
of objects with properties P(i\), P(h)i • • • , P(i r )- The 
number of objects jV(0) with none of the properties is given 
by [3]: 

N(9) = N - N(i) + £ N(h , i 3 ) + . . . + (-l) r 

* 1*1 <*3 

X X ^(*1 1 *2 » ■ • • i * r ) • • • 

»1<*2 <*r 

+ (-l) u AT(l,2, 3,. (3) 


There are u + 1 terms on the right-hand side of Eq. (3), 
with the Oth term representing the total number of objects 
in x- If all terms beyond the rth term on the right-hand 
side of Eq. (3) are ignored, then the resulting truncated 
sum is an upper bound when r is even, or a lower bound if 
r is odd. Thus the maximum error magnitude introduced 
in the inclusion and exclusion formula by ignoring all terms 
beyond the rth term does not exceed the magnitude of the 
(r+ l)th term. This fact will be used later to upper bound 
the magnitude of the errors of the approximate weight dis- 
tribution formula. 


B. Facts on Coding Theory 

A linear (n,fc,d) code over GF(q) can be generated 
by a k x n generator matrix G, not necessarily unique and 
such that rank(G) = it. Let / be the maximum number 
such that no / or fewer columns of G add to zero. Then 


/ < k 


( 4 ) 


Equality in Eq. (4) is achieved in the case of maximum 
distance separable (MDS) codes. Since G is the parity- 
check matrix of G 1 , l = d 1 — 1. Let col tl , col, 3 , . . . , col i j 
be any j particular columns of G, j < l < k. It is obvious 
that there exists a k x n generator matrix G' of C and a 
k x k nonsingular matrix K such that 


G' = KG 


( 5 ) 


and coli lt col,* 3 , . . . , col^ of G f form a k x j submatrix 

of the form This fact guarantees that given any 

pattern of j symbols on the i\ th, i* 2 th, . . ., ij th coordinates, 
the number of codewords with the j-symbol pattern on 
the i\th, « 2 th, .. ., ij th coordinates equals q k ~i for j < L 
This fact is important in the next section to evaluate the 
cardinalities of some sets of decodable words. 

III. Derivation of Formulae 

Let D be the set of decodable words of G. Let d be a 
decodable word with Hamming weight u, u > n—l. Let the 
coordinates of dbe indexed by {0, 1, . . . , n— 1}. Then d has 
v zeros (v < /), where v = n - u. Let V be a set of v coor- 
dinates, \V\ = v . Let {i \ , 1 * 2 , • , ij} Q {0, 1, . . . , n— 1} — V 

be a set of j coordinates. Define S(ii,*2i • • • » *j) — {d : 


105 


d £ D and d. has zeros in V U {»i, *2 , . . - , ij }}. A decod- 
able word d e S(ii, 1 * 2 , •••>*;) always has at least v + j 
zeros. For 0 < j < l — v, the number of zeros in the de- 
codable words of S(i\ , 1 * 2 , • • • » ij) is less than or equal to 
L Now, since all words lying within the Hamming spheres 
(with volume VJi(f)) that surround codewords are decod- 
able words, there are V n (t) disjoint cosets that contain de- 
codable words. Each coset can be constructed by adding a 
coset leader a (Hamming weight of a < <) to each codeword 
in C. Thus from the discussion in Section II. B, for each 
of the V n (t) different coset leaders (each corresponding to 
a coset), there are codewords in C which, when 

added to the coset leader, give decodable words with zeros 
in the v + j coordinates. The number of decodable words 
in S(i \ , * 2 , . . . , ij ) is then given by 

|5(»'i,«2, •••,*>)! = q k - V - j V n (t) 

for 0 < j < l — v (6) 


|S(*i»i 2> • • • > J j)l = Q U ~ J 

for n — v — t -f 1 < j < n — v (8) 


In the cases for 0 < j < / — v } n — v — d + t + 1 < 
j < n — v — t, and n ~ v — t + 1 < j < n — w, the set 
i*l, i 2 , ... , ij can be chosen arbitrarily from a set of u = 
n — v coordinates. Thus for every choice of j , there are 
(p sets S(*i, t 2 , . . . , ij). By the principle of inclusion and 
exclusion, the number of decodable words with exactly v 
zeros in V, which is denoted by D' v , is: 

0V = \sm + (- 1 ) E + • • • + (-!) r 

•1 

X E |5(tl,i 2l ...,»r)| 

t‘l<*3< -<*r 


p or / — v + 1 < j < n-v-d+t , the number of zeros in 
the decodable words of S(ii,i*2> • • • >ij) exceeds /, and ap- 
parently there is no simple expression for |S(*i,i 2 i •••>*;) I- 
For n — v — d + t + 1 < j < n - v - t, the number 
of zeros in a decodable word is greater than or equal to 
n _ d -f t + 1, but less than or equal to n — t. Thus any 
decodable word in S(t’i , i 2> • > ij) has weight less than or 

equal to d — t - 1. It is not hard to see that the elements 
of S(n,i 2 , cannot be decoded into a codeword of 

weight other than Q. Therefore, S(*i,*2» • • • > ij) contains 
all words of weight less than or equal to t in the coordi- 
nates {0, 1, ...n — 1} — (V U {ti,*2> • • • >*;})• Thus, 


+ ... + (— l) n V S(i\ f *2> * • • > in— v) 



n-v-d+t 

E 


j=l-v+l 


(-1 y 


*1<*3< <ij 


n — v — t 

+ e (- 1 y 

j =n-v-d+t+l 



V 

i 



v)i=E(“ j i ) < *- 1) ‘ 

1=0 ^ ' 

ioxn — v — d + t + \<j<n — v — t (7) 


For n-v-t + 1 < j <n-v, since j is greater than or equal 
ton-v-t + l, the number of zeros v -h j is greater than 
or equal to n - t -h 1. Therefore, the number of nonzero 
components is less than t. Thus, all words with zeros on 
KU{n,i 2 ,..., ij } are decodable and thus 


+ E (-tf (“)«“■'■ ( 9 ) 

j=n-v-t + l V - // 

If all the terms beyond the l-v-l terms are ignored in the 
above inclusion and exclusion formula, Eq. (9) is reduced 
to 


jy v = '^(-l + (io) 

j - 0 
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where 


for u > max{n — /, - /) - 1}. 


Ei = (-l ) , - v ( l *^Q k -'Vn(t) 

+ e e 

;=/-v+l *!<...<*> 


and l^i | < (j“ v )g* ^(t) (from the discussion in Section 
II- A). If E 2 = E; U =,-v(-l y(j)9*- v -*Vn(t) is added and 
subtracted from Eq. (10), one has 


D V = ^zr-Vni*) + El + El (11) 


If (,“„)« > G-“+i). th at is - if « > **(« -0-1, £2 is a 

sum of terms with alternate signs and descending magni- 
tude. Then \E 2 \ < (,“J?* _ 'K.(t). Thus 


D ’v = {j -^r-Wt) + E ( 12 ) 


where E — Ei + E 2 and \E\ < 2 (,” v )q k ~ , V„(t). iy v can 
thus be approximated by V n (t), and the goodness 

of approximation depends on how small the ratio R = 
E/[(q— l) u 9 - (” - *lK,(t)] is. By using the upper bound on 
|£|, an upper bound on this ratio is given by 


R< 


(?- i) u 


(13) 


Strictly speaking, the derivation of Eq. (14) is valid 
only for u > max{n — /, ^-(n — /) — 1}. However, it is 
observed that in most cases Eq. (14) is also a close ap- 
proximation to D u for u considerably smaller than n — / 
(as in the case of Reed-Solomon codes). The upper bound 
of R derived above has a denominator term (q — l) u and 
this indicates that this approximation formula is good for 
nonbinary linear codes, and is not useful for binary linear 
codes. The looseness of this approximation for binary lin- 
ear codes is best illustrated by extended binary codes that 
have only even weights. In the case of binary primitive 
codes, Kasami et al. [4] generalized Sidel’nikov’s approach 
[5] and showed that the weights of most binary primitive 
codes have approximate binomial distribution. 

Cheung [6] later showed this is also true for nonbinary 
codes. It is conjectured in this article that the approximate 
Eq. (14) for the weight distribution of decodable words is 
also good for binary primitive codes. For nonbinary linear 

codes, the upper bound on R shows that the approxima- 
tion in Eq. (14) is particularly good for codes with large 
alphabet sets. The upper bound on R for the (31,15,17) 
Reed-Solomon code over GF( 32) is given in Table 1. The 
weight distribution of decodable words and its approxima- 
tion (using Eq. 14) of the (31,15,17) Reed-Solomon code 
are given in Table 2. 

Given the approximate formula of D u , an approxi- 
mate decoder error probability Pe(u) is obtained by sub- 
stituting Eq. (14) into Eq. (1). It is observed that Pe(u) 
approximates the constant Q = q~ r V n (t) as u gets large, 
where Q is the probability that a completely random error 
pattern will cause decoder error. An upper bound of R 
given by Eq. (13) shows that Pe(u) approaches Q “nearly 
exponentially” (for nonbinary codes) as u increases. 


Since v < /, there are (”) = (”) ways to choose v zeros 
from {0,1,.. . ,n — 1}. Then D u can be approximated by 
the following expression: 


D u = E ^v«9- (n - fc) (”)(«- 1)*V«(«) ( 14 ) 

|V|=n— u V 


IV. Conclusion 

In this article, by using the inclusion and exclusion 
principle, an approximate formula for the weight distri- 
bution of decodable words of most linear block codes is 
derived. The decoder error probability Pe(u), which is a 
function of D U1 is then shown to approach the constant 
Q as u gets large, where Q is the probability that a com- 
pletely random error pattern will cause decoder error. 


107 



References 


[1] R.J. McEliece and L. Swanson, “On the Decoder Error Probability for Reed- 
Solomon Codes,” IEEE Trans . Inform . Theory , vol. IT-32, pp. 701-703, 1986. 

[2] K. Cheung, “More on the Decoder Error Probability of Reed- Solomon Codes,” 
to appear in IEEE Trans. Inform. Theory. 

[3] R. Stanley, Enumerative Combinatorics , Vol. /, Monterey, California: 
Wadsworth and Brooks-Cole, 1986. 

[4] T. Kasami, T. Fujiwara, and S. Lin, “An Approximation to the Weight Distri- 
bution of Binary Linear Codes,” IEEE Trans. Inform. Theory , vol. IT-31, no. 
6, pp. 769-780, November 1985. 

[5] V. M. SidePnikov, “Weight Spectrum of Binary Bose-Chaudhuri-Hocquenghem 
Codes,” Probl. Peredachi Inform ., vol. 7, no. 1, pp. 14-22, January-March, 
1971. 

[6] K. Cheung, “The Weight Distribution and Randomness of Linear Codes,” TDA 
Progress Report 42-97, vol. January-March 1989, Jet Propulsion Laboratory, 
Pasadena, California, pp. 208-215, May 15, 1989. 


fHKtfl p/a 

N90-12794 

TDA Progress Report 42-98 August 15, 1989 

l,i 


■>3 

%/f 


Automated Monitor and Control for 
Deep Space Network Subsystems 

P. Smyth 

Communications Systems Research Section 


This article will consider the problem of automating monitor and control loops 
for Deep Space Network (DSN) subsystems. The purpose of the article is to give 
an overview of currently available automation techniques. In particular, the article 
will consider the use of standard numerical models, knowledge-based systems, and 
neural networks. Among the conclusions is argued that none of these techniques 
alone possess sufficient generality to deal with the demands imposed by the DSN 
environment. However , the article will show that schemes that integrate the bet- 
ter aspects of each approach and are referenced to a formal system model show 
considerable promise — although such an integrated technology is not yet available 
for implementation. The article should be of benefit to readers interested in any 
aspects of DSN automation as it highlights the advantages and dangers of currently 
available approaches to the automation problem. The article frequently refers to 
the receiver subsystem since this work was largely motivated by experience in de- 
veloping an automated monitor and control loop for the advanced receiver. 


I. Introduction 

Consider the problem of designing an automated mon- 
itor and control loop for a DSN subsystem. This arti- 
cle will refer to the “system” as the system being con- 
trolled and the “loop” as the hardware or software (or 
human) component which is used to monitor the system 
and control its behavior. For example, a receiver is a sub- 
system which is normally controlled in a relatively man- 
ual manner by human operators. This article will argue 
that reliance on purely manual control techniques will not 
be sufficient to cope with future subsystem technologies 


and an increasingly complex DSN ground station envi- 
ronment. Autonomous or semiautonomous control loops 
will be necessary. This article will address some basic is- 
sues regarding automated monitor and control loops: how 
should such loops be designed and what general princi- 
ples should be used? Specifically, the problem is defined 
in a very general manner using a state-space model and 
evaluating current technologies such as knowledge- based 
systems and neural networks in this context. The monitor 
and control problem can be decomposed into four basic 
subfunctions: obtain sensor data, estimate system param- 
eters from the data, make decisions based on the estimated 
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Table 1. Upper bound on R 
for the (31.15.17) RS code 


u 

R 

16 

2.74943144e— 024 

17 

1.50775270e— 024 

18 

4.37734673e— 025 

19 

8.94296586e— 026 

20 

1. 442413 89e— 026 

21 

1.95423782e— 027 

22 

2.31146419e— 028 

23 

2.44993596e— 029 

24 

2.37090920e— 030 

25 

2.12446863e— 031 

26 

1.78181347e— 032 

27 

1.41082139e— 033 

28 

1.06190837e— 034 

29 

7.64152968e— 036 

30 

5.28215447e— 037 

31 

3.52143591e— 038 


Table 2. Weight distribution and its approximation 
for the number of decodable words of the (31,15,17) 
RS code over GF(32) 


u 

Du (exact) 

D'u (approximate) 

9 

1.998e+014 

2.999e+015 

10 

5.232e+016 

2.045e+017 

11 

6.348e+018 

1.210e-f 019 

12 

4.817e+020 

6.253e+020 

13 

2.613e+022 

2.833e+022 

14 

1.113e+024 

1.129e-f 024 

15 

3.970e+025 

3.968e+025 

16 

1.231e+027 

1.230e+027 

17 

3.364e+028 

3.364e+028 

18 

8.111e+029 

8.111e+029 

19 

1.721e+031 

1.721e+031 

20 

3.200e+032 

3.200e+032 

21 

5.196e+033 

5.196e+033 

22 

7.322e+034 

7.322e+034 

23 

8.822e+035 

8.882e+035 

24 

9.178e+036 

9.178e+036 

25 

7.967e+037 

7.967e+037 

26 

5.699e+038 

5.699e+038 

27 

3.272e+039 

3.272e+039 

28 

1.449e+040 

1.449e+040 

29 

4.647e+040 

4.647e+040 

30 

9.603e+040 

9.603e+040 

31 

9.603e+040 

9.603e+040 


parameters, and implement the decision by changing sys- 
tem inputs. Previously reported work on station automa- 
tion [1, 2] has focused on the automation of operational 
procedures, namely the automation of data collection and 
control implementation, the first and fourth subfunctions 
listed above. Traditionally, the estimation and decision- 
making subfunctions have been performed manually by 
human operators. These are precisely the subfunctions 
on which this article focuses, i.e., this is what is referred 
to as automated monitor and control. 


II. A Statement of the Problem 

A simple general model of a DSN subsystem which 
one wishes to control is shown in Fig. 1. I 0 (i) and 0 0 (t) 
are defined as the observable inputs and outputs respec- 
tively, and I d (t) and Oo(t) as the nonobservable inputs and 
outputs, where f denotes time. H is the open-loop transfer 
function of the system, i.e., 0(f) = H (/(f)). 

In addition, there is possibly an a priori system model 
M . The rules of the game comprise of being given / 0 (f), 
0 Q (t ), and M at time f, and trying to effect a particular 
input/output transfer function H at some time f -f 6t by 
changing whatever subset of the inputs / 0 (f) one is allowed 
to control. This is a very general description, and, for ex- 
ample, could equally well describe an implementation of 
an adaptive control loop using classical control theory, or 
a human trying to perform real-time fault-diagnosis in or- 
der to restore normal operating system behavior. A key 
point of this article is that there is a need to be concerned 
with all such levels of control. The need for modeling very 
different control loops of this kind in an integrated manner 
will be identified. The function of most DSN subsystems 
is to deliver or process signals or data subject to some cri- 
teria. Hence, the automated loop must be able to perform 
a variety of functions such as performance optimization 
and fault detection in order to satisfy overall system per- 
formance criteria. The article will return to this point 
in much greater detail later. This requirement will entail 
that the loop has a model , in some form, of the system it 
is trying to control. 

Figure 2 shows the general feedback control scheme, 
where G represents the control-loop function. For exam- 
ple, if the system can be modeled accurately by closed- 
form analytic solutions, then G may be implemented as 
a set of equations such as a phase-locked loop (PLL) or 
other such receiver structure. However, at the level of in- 
terest here (namely, entire subsytems such as the receiver), 
analytic closed-form solutions are not practical and it is 


traditional to close the loop manually, i.e., G is a human 
operator. 

For two reasons the usual engineering analysis ap- 
proach will not work directly on this problem: first, the 
open-loop system transfer function (call it H) is often non- 
linear and too complex to model accurately. Secondly, the 
transfer function may vary over time and be a function of 
the input variables. Remember that this is an entire sub- 
system involving the interaction of hardware and software 
components. Hence, the system will have a set of states 
(possibly infinite). Let the state of the system at time t 
be 5(f). The transfer function H is then H (5(f), /(f)). 
The only time when H is known exactly may be for a sub- 
set of states Sn ) which might be thought of as “normal 
operating states.” This is precisely the case for which lit- 
tle or no external feedback ( G in Fig. 2) is required since 
internal feedback in the subsystem components (for ex- 
ample in the PLL component) will automatically produce 
the desired output. One is interested of course in the other 
cases (5(f) £ 5„), namely, spurious system behavior due 
to “non-normal” inputs, faulty components, etc. 

As a concrete example, consider the receiver subsys- 
tem. Receivers are designed to work specifically on a 
known class of modulated signals at specific predicted fre- 
quencies. Hence, S n corresponds to the case where the 
signal is at the correct position in the spectrum, of the cor- 
rect modulation format, and the noise is bounded within 
expected characteristics. As is well-known, achieving opti- 
mal receiver performance even when all these assumptions 
are true is a very non-trivial design problem. Hence, sys- 
tem designers can hardly be expected to account for the 
large class of spurious system states that may arise dur- 
ing field operations, e.g., new noise characteristics, in-band 
radio-frequency interference (RFI), problems in other sta- 
tion subsystems, or system components which behave er- 
roneously or fail. As previously mentioned, it has been 
traditional in field environments such as the DSN to have 
the human operator detect and correct spurious system 
behavior using his/her knowledge of the system and the 
environment. A point of this article is that manual con- 
trol of this nature is neither feasible nor desirable in the 
long run. 

There are two primary considerations: where to place 
G relative to existing subsystems and human operators 
and, secondly, how to implement G. This article will pri- 
marily focus on implementation issues, but will first briefly 
consider the issue of where to place an automated loop 
component relative to the existing DSN subsystem envi- 
ronments. 
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III. Basic Architectural Considerations 

Consider once more Fig. 2. There are a number of pos- 
sible avenues one could pursue. One could close the loop 
within the box (i.e., internally in the system as shown in 
Fig. 3) and let the the operators function as they currently 
do, relating inputs to outputs with little information as to 
the inner state of the system. This is feasible and reduces 
interface problems, but may actually introduce a new set 
of difficulties for the operator: the behavior of the internal 
control loop may make it very difficult for the operator to 
understand the behavior of the system, i.e., it is even more 
difficult to operate than previous “open-loop” systems. 

An alternative approach is to replace the operator en- 
tirely and close the loop completely autonomously (Fig. 4). 
This approach is probably not necessary for DSN sub- 
systems, since it is widely acknowledged that in any au- 
tomated control system operating in uncertain environ- 
ments, human intervention may always be necessary in 
emergency situations. A good example is that of autopilot 
control schemes in commercial airliners, where emergency 
situations require that the human pilots break the auto- 
matic control loop. 

This paradigm of semiautonomous control (Fig. 5) 
introduces the important component of situation assess- 
ment. In airliners, for example, for the pilot to break the 
automated loop and effectively take over control, he must 
be able to assess the current state of the system or the 
context in which the automated control loop was acting. 
This issue requires that the automated system must in 
some sense be able to communicate at any time the cur- 
rent state of both the system and the control loop to a 
human operator. Hence, semiautonomous control entails 
that the control loop contains a high-level model of the 
system it is trying to control. 

Another important practical point is that in order for 
the control loop G to be effective it must have access to all 
of the relevant system data and be able to control system 
parameters, i.e., a comprehensive sensory and effectory 
system must be in place. This is a fundamental require- 
ment. Subsystems should be designed initially such that 
future hardware and software links to external automated 
control loops can be easily and cleanly implemented, e.g., 
hardware ports, software accessibility, etc. This holds true 
even if the automated loop is built into the originally de- 
signed system, since at the next level of automation these 
control loops will need to communicate with higher-level 
systems such as a monitor and control system for an entire 
ground station. 


IV. A State-Space Model for 

Automated Monitor and Control 

Consider in more detail the nature of the control loop. 
One can decompose the problem into two parts: deter- 
mining the state of the system at time t (estimation) and 
effecting some control action on the system to achieve de- 
sired behavior at time t + St (decision). Monitoring the 
state of the system is a necessary prerequisite to controlling 
it, since controller actions need to be context dependent, 
i.e., what is done to control the system depends on the 
state of the system and the input variables (both observ- 
ables and unobservables). This is the crux of the problem. 

As described earlier one can define S(t) as the state of 
the system at any time t , and S n is a set of states defined as 
“normal operating behavior.” Hence one can define the set 
of states Sn as system states indicating spurious behavior 
and potentially requiring corrective control action, where 
Sn is the complement of S n over the entire set of possible 
system states. The purpose of the monitor/controller G 
can be restated as follows: 

(1) At any given time t obtain a state estimate S of the 
true state S(t) of the system, where 

s = f (/.(<), 0.(0, M, h(S, I 0 , Oo, o) (1) 

where I 0 (t) and 0 Q (t) are the observable inputs and 
outputs, M is an a priori model of the system, and 
h(-) is a finite history of previous estimated states and 
input and output variables. 

(2) If S G Sfi choose a control action to achieve a specific 
set of outputs at time t + 6t in the future, by setting 


Io(t + St) = g(s{t), M, I 0 (t), h(S, I 0 , 0.,0) (2) 

While this state estimation and transformation model 
is not the only possible model, and is somewhat of a sim- 
plification, it serves the purpose of giving a good insight 
as to the basic nature of the problem. The first step, that 
of estimating the state of the system, is quite challeng- 
ing in itself since the state-space model will necessarily be 
quite complex for any realistic system. This step must be 
solved before proceeding to step (2), since typically control 
actions are quite sensitive to context information, i.e., the 
state estimate 5. For example, an operator’s actions on 
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a receiver depend directly on what the operator perceives 
the current state of the receiver to be, e.g., whether or not 
the PLL is in lock or not. 

The state-space model is useful because of its gener- 
ality. In this framework a phase-locked loop is a very well- 
specified state estimation and control scheme operating on 
a signal in real-time. At the other end of the spectrum, 
the human operator is implementing a much more heuris- 
tic and adaptive estimation and control scheme at a much 
slower rate, what one might term “human real time.” One 
is interested in control loops that operate between these 
two extremes. The remainder of the article will examine 
current technologies in the context of this state estima- 
tion and control model. The possible approaches can be 
divided into three broad paradigms (standard numeric al- 
gorithms, knowledge-based systems, and neural networks) 
not because they form a well-defined taxonomy for model- 
ing autonomous control systems, but simply because of the 
widespread interest in techniques of this nature and their 
potential applications. A goal of the article is to clarify to 
which types of problems each approach is best suited, and 
in the process debunk some common myths. 


V. Standard Analysis and Algorithms 

Engineers like to model systems using differential or 
difference equations and then, by analysis of the mathe- 
matical model, design an estimator and controller subject 
to cost and performance criteria. The quality of the result- 
ing control loop hinges critically on the accuracy of the as- 
sumed model, e.g., whether the real noise statistics fit the 
parametric model or not. By weakening the assumptions 
(e.g., by using non-parametric statistical models) one can 
improve the generality of the model but it then becomes 
correspondingly more difficult to design the control loop. 

Some systems are simply impossible to model accu- 
rately using closed-form mathematical solutions alone. It 
is a key point of this article that DSN subsystems fall into 
this category. The advanced receiver subsystem, for ex- 
ample, will be a complex mixture of interacting hardware 
and software components. Clearly, at the local compo- 
nent level, existing analytical models will be quite accu- 
rate, e.g., the transfer function for an accumulator or an 
automatic gain control (AGC) circuit. However, one is in- 
terested in the global picture, how the components interact 
together to determine system behavior. Direct modeling 
via equations is hardly practical. For instance, how does 
one combine information such as U E S /N 0 = 9.5 dB” and 
“the board for calculating FFTs is down”? 


The key point to note is that variables of interest must 
be abstracted to the appropriate level of representation. 
The issue of representation is critical. The model should 
necessarily entail different levels of abstraction, from rel- 
atively high-level system configuration concepts (such as 
a board being down) down to quantitative real-time pa- 
rameter estimates (such as the value of E 8 /Nq). One can 
conclude that exact mathematical models have an impor- 
tant role in control loops but are not sufficiently powerful 
in terms of representation to meet requirements. 


VI. Knowledge-Based Systems 

The term “knowledge-based system” is used rather 
loosely to describe any system where knowledge about the 
system is explicitly represented in symbolic form, as op- 
posed to being embedded in a low-level algorithm. There 
is an increasing realization that any system which intends 
to communicate with other agents, be they other pieces 
of software or humans, needs to be able to abstract its 
information to the appropriate level. In addition, explicit 
knowledge representation buys the ability to modify the 
control system, to update it, etc., in an efficient man- 
ner. The potential value of portable, easily- modifiable 
code that can be changed relatively effortlessly is enor- 
mous. 

A. Rule-Based Expert Systems 

Rule-based expert systems have proven to be very 
popular in recent years as solutions to the type of con- 
trol problem discussed so far. Essentially, they attempt 
to replace or augment the human operator by a program 
which contains a representation of his expertise as a set 
of rules. The question will be “Is this a good modeling 
tool for solving the DSN automated monitor and control 
problem ?” 

First consider the basic characteristics of this 
approach. The rules represent situation-action pairs or 
symptom-diagnosis pairs based on the expert’s experi- 
ence. Generally this approach works well in domains where 
experts do well and the domain is not particularly well- 
understood at a basic theoretical level, e.g., in many med- 
ical applications where basic cause and effect relationships 
are not known. Conversely, a rule-based system degrades 
rapidly in performance when faced with novel problems 
(symptom combinations not encoded as rules) which the 
expert has never encountered before. In artificial intel- 
ligence, this type of rule-based representation, based on 
experiential knowledge alone, is referred to as a “shal- 
low” model, since it is based on simple pattern-matching 
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without reference to any deeper causal domain theory. 
It is important to point out that impressive results have 
been achieved in real applications using rule-based systems 
and that the technology is available for implementation at 
present and is well-understood. What this article is try- 
ing to do is to identify the limitations of the approach in 
relation to a particular monitor and control problem. 

With reference to Section IV, which defined the mon- 
itor and control problem as a two-step procedure based on 
the estimation function /( ) and decision function g( ), 
a rule-based implementation can be considered as follows: 
effectively the functionality of /( ) and g( ) are combined 
together into single condition-action pairs or rules. Hence 
the rules define a heuristic mapping from system parame- 
ters to control actions. In this sense the state of the system 
is never formally evaluated. The consequent implications 
for semiautonomous control and situation assessment are 
obvious, i.e., it will be very difficult for an external agent 
to take over system control in an effective manner. 

Rule-based systems are not particularly appropriate 
in domains where any of the following hold true: 

(1) The experts have limited experiential knowledge (are 
not really experts) or, indeed, there are no experts. 
This latter problem arose in the advanced receiver 
expert system project since the system has not been 
fielded in an operations environment. 

(2) There exists a well-defined domain theory, at least at 
the component level, i.e., well-defined functional and 
structural properties in the system exist which are 
fundamental to problem diagnosis. 

(3) The external environment is subject to change and, 
hence, novel problems are likely. 

(4) Time and procedural information are critical to rea- 
soning about the domain, e.g., in setting up a link to 
a spacecraft there is a definite sequence of procedu- 
ral steps which occur. Rules are very inefficient at 
representing procedural knowledge. 

System designers are not the types of experts that 
rule-based expert systems are intended to model. Their 
knowledge consists of causal reasoning based on first prin- 
ciples rather than the experiential situation-action pairs of 
an experienced system operator. While it is true that one 
can recast basic domain knowledge in the form of rules, it 
tends to obscure the underlying causal domain theory, i.e., 
it is not an efficient way to represent this type of knowl- 
edge. For a new system like the advanced receiver there 


is no experiential or heuristic knowledge available. This 
problem of designing “expert” systems without experts 
is quite challenging — this article will show later how the 
notions of model-based reasoning and learning algorithms 
may provide useful solutions. A practical approach might 
be to initially field the system with some form of intelli- 
gent model-based control loop which includes a capability 
for rule-based representation, and, as expertise develops, 
integrate experiential rules into the loop. 

Given the above observations, it is clear that rule- 
based expert systems are not a very good overall model 
for the problem discussed here. It is important to keep 
in mind that rule-based systems were originally developed 
primarily by the medical community as a consultative off- 
line tool, i.e., a program with which a human interacted, 
answered queries, and received recommendations. Expert 
systems were never explicitly designed to operate in active, 
real-time modes in domains where expertise is scarce and 
reasoning from first principles is essential. 

Nonetheless it is important to counter the common 
myth that rule-based models require dedicated hardware 
and software components to work and operate separately 
from “ordinary” software on special purpose machines. 
Recent trends in the application of rule-based systems 
have been very much focused on integrating the rule-based 
approach with standard software, e.g., rule bases can be 
called by standard procedural programs, integrated with 
common databases, etc. At the present time most com- 
mercial shells are available in languages such as C. Hence, 
the paradigm of a standard procedural control loop that 
can call a function to perform rule-based reasoning (to de- 
termine the value of a global context variable, perhaps), 
which is written entirely in a relatively low-level language 
such as C, and which can be executed by a single-chip mi- 
croprocessor may be a much more useful paradigm than 
relying on the conventional rule-based approach as the ba- 
sic model for control loops. 

B. Model-Based Reasoning 

Researchers in artificial intelligence have recognized 
the limitations of purely rule-based systems as outlined 
above and have consequently moved on to investigating 
more powerful knowledge representation techniques. 
Atkinson [3] gives a useful overview of recent AI appli- 
cations for monitor and control in the aerospace domain. 
This is of interest since, as with the DSN subsystem prob- 
lem, monitor and control of aerospace systems involves a 
large amount of temporal, procedural, and first-principles 
knowledge. The notion of model-based reasoning is loosely 
based on the idea that if a reasoning system knows the 
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function and structure of each system component (at some 
appropriate level of representation), and knows how these 
components are connected together, then by observing cer- 
tain input and output system parameters it should be able 
to reason from first-principles to infer system behavior, 
e.g., detect the most likely cause of a fault. This is an 
attractive paradigm for the domain discussed here. Re- 
search in this area has primarily been domain-dependent, 
with perhaps the most common domain being that of trou- 
bleshooting electrical circuits [4]. A particularly notable 
application (in a different domain) is the LES system [5] 
which monitors and controls the critical loading of liquid 
oxygen onto the space shuttle at Kennedy Space Center 
prior to launch — the system has been able to perform fault 
detection and isolation in 10 sec as compared to the 20 min 
taken by manual methods. 

Model-based reasoning is quite different from the rule- 
based expert system approach and has been associated 
with the term “deep knowledge” or common-sense reason- 
ing. Unfortunately, from an applications point of view, 
there are many unsolved problems which remain the sub- 
ject of ongoing research. For example, the computational 
requirements can scale exponentially with the number of 
components under consideration. Another problem is that 
there has been very little applied research in this area and 
hence there is very little practical feedback available for the 
purposes of determining the appropriate level of granular- 
ity for such models. For example, for a signal-processing 
subsystem such as a receiver, should the model entail first- 
principles knowledge regarding linear systems theory, or is 
this too detailed? A useful overview on the current state 
of research in applying model-based reasoning to commu- 
nication systems is given in [6]. 

However, despite the unresolved issues, it is impor- 
tant to realize that a priori knowledge of DSN subsystems 
often falls into this model-based category, i.e., well-defined 
component models of function and structure. With the ad- 
vanced receiver, for example, this is very much the case, 
but with the scarcity of research results and applications 
in this area (none at all in the communication systems do- 
main to the author’s knowledge) it was not possible to ap- 
ply these ideas directly to the problem at hand within the 
resources available. One can conclude that model-based 
reasoning appears promising as a useful modeling tool for 
the DSN domain and worthy of further attention. 

C. Artificial Intelligence and Decision Theory 

There has recently been a considerable amount of 
cross-disciplinary work relating AI with decision analysis 
and decision theory. Decision theory and analysis grew 


out of statistical decision theory and has developed a con- 
siderable basis of theoretical principles and practical tech- 
niques for modeling choice and decision in uncertain en- 
vironments. Applications of this technique have typically 
been for decisions in economic and military environments 
where decision theory can render decision-making a formal 
statistical modeling problem rather than a subjective pro- 
cedure, e.g., an oil company may wish to decide whether or 
not to commit resources to drilling an oil well in a particu- 
lar location. More recently, this work has been applied to 
problems which are typically within the AI domain, e.g., 
fault diagnosis or reasoning with uncertainty. The key 
shift in focus has been the notion of using decision the- 
ory as a theoretical model for decision by an autonomous 
agent, rather than using it to analyze specific human deci- 
sion scenarios. In a sense, one might say that this involves 
decision synthesis rather than decision analysis. 

Whether or not decision theory can be applied to the 
DSN monitor and control problem remains an open ques- 
tion. As with model-based reasoning, research on this 
topic is still in relative infancy. Theoretically, since the 
monitor and control loop discussed in this article involves 
a decision element, there can be no question that decision 
theory is an appropriate formal model for an autonomous 
decision agent. There are two key benefits that could be 
gained from using this approach: 

(1) Decision analysis emphasizes and provides many tools 
for the initial structuring and modeling of the prob- 
lem at hand, e.g., the use of influence diagrams, a 
technique for structuring and identifying causal mod- 
els. Using these techniques would formalize the initial 
model-building part of the control loop design process. 

(2) The use of statistical decision theory as the basis for a 
rational decision agent is particularly appealing. For 
example, in an advanced receiver it may be possible to 
combine several fast Fourier transform (FFT) power 
spectra in a noncoherent manner. Deciding how many 
spectra to average over involves a tradeoff between the 
variance of the spectral estimate and the time taken 
to perform the estimate, i.e., the time delay in the 
control loop. Clearly this decision is very much con- 
text-dependent on parameters such as carrier- to- noise 
ratio and estimated carrier Doppler rate. For exam- 
ple, it is more important to minimize the time-delay 
when one is not in lock than when one is in lock. Util- 
ity is defined as a quantitative measure of the overall 
benefit or cost which accrues to the decision-maker 
from a particular action/outcome pair, i.e., if he/she 
decides on an action a,-, and event ej subsequently oc- 
curs, u(aj, ej) is a defined utility for this pair. For the 
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purposes here, utility might be defined as the vari- 
ance/time tradeoff associated with each action/event 
pair. Rational decision-making corresponds to choos- 
ing the action a* which maximizes one’s expected 
utility with respect to the probability space defined 
over relevant events, i.e., a probability space involving 
con text- valued variables such as in-lock/not-in-lock. 

However, as with standard algorithms and analysis, 
decision theoretic models alone are not sufficiently rich as 
a knowledge representation scheme to offer a stand-alone 
solution. Nonetheless, there is a strong case for their in- 
clusion as a solution component and their integration with 
more conventional approaches. 


VII. Neural Networks 

What is a neural network? One view (from an algo- 
rithm designer’s perspective) is that they are highly dis- 
tributed, nonlinear models for implementing function es- 
timation, i.e., given input and output samples of a “black 
box” the neural network can reproduce the functionality 
of the “black box” in general. This view may be somewhat 
narrow, but generally speaking, optimization, adaptation, 
pattern recognition, etc., can all be suitably recast into the 
notion of function estimation. Hence, for example, neural 
networks can form the basis for computationally power- 
ful, nonparametric statistical estimation tools. Indeed, it 
seems that early successful applications of this technology 
may be found in computationally intensive pattern recog- 
nition and signal processing problems which have proven 
to be difficult to model via traditional serial, linear, non- 
adaptive models, e.g., speech recognition, optical character 
recognition, nonlinear signal processing, adaptive control, 
etc. Typically, it seems that the less one knows about 
the problem solution a priori (e.g., the less one under- 
stands about the noise characteristics of the problem), the 
more favorably a neural solution will compare to standard 
alternatives such as Markov models, Bayesian classifiers, 
etc. The availability of dedicated hardware within the next 
decade will be a major factor in determining the future of 
neural network technology. 

How do neural networks relate to the monitor and 
control problem in this article? One could certainly envi- 
sion possible applications in terms of real-time estimation 
and adaptive control, particularly at the signal level. But 
as with each of the approaches we have considered earlier, a 
neural network would not be sufficient as an overall system 
model. In particular, given the current understanding of 
the neural approach, there is no way to incorporate a priori 


knowledge into the network. In terms of the earlier two- 
step model, the decision function g( ) is learned entirely by 
the network itself, with no reference to the a priori system 
model M. Neural networks are essentially “black boxes” 
in that they learn and implement input/output mappings 
efficiently — but their representation of this mapping is im- 
plicitly embedded in a distributed manner in terms of 
learned weights. Hence, for example, one cannot incor- 
porate existing functional and structural knowledge about 
the subsystem discussed here into the network. One can 
conclude however that as a subcomponent of the overall 
model, say as a nonparametric signal/RFI classifier, the 
neural network approach shows significant promise pro- 
vided the hardware becomes available. 


VIII. Summary 

The need for automating monitor and control func- 
tionality in DSN subsystems is becoming more apparent 
as technology (sensors and computational resources) im- 
proves to the extent that operators can no longer be ex- 
pected to assimilate system data and make decisions in 
real time. This article has shown that in whatever man- 
ner one may implement such a monitor and control loop, 
it clearly needs to include a model of the system, whether 
it is implicitly embedded in equations or rules, or more 
formally represented in an explicit manner. The argu- 
ment for a formal representation model revolves around 
three facts: first, it renders the loop design problem more 
tractable as it provides the reference point from which to 
integrate different numeric and symbolic approaches. Sec- 
ondly, it increases both the life cycle and the flexibility 
of the designed software (making it easier to transfer pro- 
totype systems of this nature from a research laboratory 
to field implementation). Thirdly, it facilitates communi- 
cation between the subsystem and higher authorities, be 
they human or automated. 

It is apparent that the currently available paradigms 
for implementing control loops each have distinct advan- 
tages, yet, for the purposes described in this article, none 
are sufficiently powerful as an overall system model. It 
may be conjectured from these observations that success- 
ful monitor and control systems in the DSN domain will 
incorporate models which integrate the more appropriate 
aspects of each approach. One could envision a model- 
based representation that is run using decision-theoretic 
principles and that controls and calls lower-level analytic 
models and neural-based estimation schemes in a context- 
dependent manner. For example, in the receiver, compu- 
tational resources could initially be focused on acquisition 
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(using an FFT to locate peaks in the spectrum [7]), while, 
once lock is achieved, resources could be allocated to RFI 
monitoring and parameter optimization. 

In the short term, the issue of model-based techniques 
and their integration with standard numerical models ap- 
pears worthy of further investigation and application. For 
example, the use of object-oriented programming tech- 
niques appears to have considerable potential for imple- 
menting models of this nature. Rule-based expert systems 
will have their role to play, but the system implementer is 
advised to be wary if the domain primarily involves tempo- 
ral, procedural, or first-principles knowledge. Using rule- 
based systems as a component of an overall solution seems 
like a much more reasonable approach, particularly given 
that (as discussed earlier) rule-based code can now be writ- 
ten in “low-level” languages such as C and consequently 
embedded in standard software. Neural network technol- 
ogy may be applicable to specific subcomponents of the 
estimation and control problem, particularly once dedi- 
cated hardware becomes available. Hybrid systems, which 
combine aspects of each approach appear potentially to 
be the most promising avenue, but little research and/or 
applications has been carried out in this arena, and hence 
they appear to be more of a long-term prospect. 

A final point worth noting concerns that of learning 
algorithms, i.e., systems which can improve their perfor- 
mance over time. A little thought will convince the reader 
that, in this domain, learning may proceed much more ef- 


ficiently and effectively if it is referenced to an initial sys- 
tem model. A very general viewpoint is that in modeling 
any system one can introduce an a priori bias (e.g., a set 
of assumptions) into the model, as in Bayesian statistics. 
A strong bias will only help the model if it is accurate, 
whereas very weak biases will make the learning problem 
much more difficult. A case in point for automated mon- 
itor and control of man-made systems is that the a priori 
bias, in the form of a functional and structural component 
model M, is necessarily correct and accurate. Hence, in 
principle, it can only improve the quality of the overall 
control loop and any learning componenttherein. 


IX. Conclusion 

Many real-time monitor and control applications will 
not be sufficiently well-modeled by rule-bases, neural net- 
works, or standard algorithms on their own. This article 
reviewed the current state-of-the-art in this area and saw 
that more powerful representational models are in the off- 
ing. One concludes that an effective modeling technique 
would be one which integrated the better aspects of each 
approach by referencing them to an explicit functional and 
structural model of the system. Off-the-shelf solutions of 
this nature are not currently available. Hence, until such 
techniques are available, system developers should choose 
their monitor and control models carefully and recognize 
the limitations of each particular technology. 
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Fig. 5. Semiautonomous control loop. 
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Frame Synchronization Methods Based on 
Channel Symbol Measurements 
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The current DSN frame synchronization procedure is based on monitoring the 
decoded bit stream for the appearance of a sync marker sequence that is transmit- 
ted once every data frame. This article explores the possibility of obtaining frame 
synchronization by processing the raw received channel symbols rather than the 
decoded bits. Performance results are derived for three channel symbol sync meth- 
ods, and these are compared with results for decoded bit sync methods reported 
elsewhere. It is shown that each class of methods has advantages or disadvantages 
under different assumptions on the frame length , the global acquisition strategy, 
and the desired measure of acquisition timeliness. 

It is shown that the sync statistics based on decoded bits are superior to the 
statistics based on channel symbols , if the desired operating region utilizes a prob- 
ability of miss many orders of magnitude higher than the probability of false alarm. 
This operating point is applicable for very large frame lengths and minimal frame- 
to-frame verification strategy. On the other hand , the statistics based on channel 
symbols are superior if the desired operating point has a miss probability only a 
few orders of magnitude greater than the false alarm probability. This happens for 
small frames or when frame-to-frame verifications are required. Among the three 
channel symbol methods examined, the squared-distance statistic offers the best 
performance in the range of normal signal-to-noise ratios, but it degrades more 
rapidly than the correlation statistic or the hard-limited symbol discrepancy count 
statistic when the signal-to-noise ratio is extremely low. 
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I. Introduction 

The current DSN frame synchronization procedure is 
based on transmitting from the spacecraft a convolution- 
ally encoded 32-bit sync marker sequence at the beginning 
of every data frame consisting of around 10 4 bits. On the 
ground, the decoded bit stream is monitored for a 32-bit 
window of agreements with the marker, and a likely sync 
location is identified by comparing the number of disagree- 
ments with a preselected threshold. In addition, a frame- 
to-frame verification strategy is employed to definitively 
declare sync acquisition or sync loss. 

This article explores the possibility of obtaining frame 
synchronization by processing the raw received channel 
symbols rather than the decoded bits. Performance re- 
sults are derived for three channel symbol sync methods, 
and these are compared with results for decoded bit sync 
methods reported elsewhere [1, 2]. It is shown that each 
class of methods has advantages or disadvantages under 
different assumption on the frame length, the global ac- 
quisition strategy, and the desired measure of acquisition 
timeliness. 


II. Various Possible Synchronization 
Methods 

A. General Framework for a Wide Class of 
Synchronization Methods 

The DSN’s current method of frame synchronization 
can be viewed as a prototype of a fairly general type of 
synchronization scheme. There are three basic levels to 
this scheme. First, a statistic x is measured at all possible 
locations of the frame marker, and the observed values 
of x are used to identify likely locations of the marker. 
Second, values of x obtained at all possible locations within 
a frame and for different frames are correlated for one or 
more frames to verify and select the most likely marker 
location. Third, the statistic x is monitored from frame to 
frame at the selected marker location to verify continued 
sync. 

This three-level synchronization model can be com- 
pletely general if the decisions at all three levels can depend 
in an arbitrary way on all the observed data. In practice, 
it is usually assumed for ease of implementation that the 
three levels of decisions are mostly decoupled. This article 


considers only the first two levels, with primary emphasis 
on the first level. 

Ideally, the statistic x should be a sufficient statistic 
summarizing all the relevant information in the data. In 
practice, x is constrained to be a meaningful, but easy to 
compute, measure of the data. There are many possible 
choices for the statistic x. This statistic may be based di- 
rectly on observation of the received channel symbols, or it 
may be calculated from the decoded output of the Viterbi 
decoder. This article analyzes several different statistics 
based on channel symbol measurements, and briefly dis- 
cusses some other statistics based on decoded bits; these 
statistics are reported elsewhere [1, 2]. 

Figure 1 is a general system diagram introducing nota- 
tion for describing the two types of synchronization statis- 
tics. The stream of incoming “data” bits {6, } includes both 
true data bits and sync marker bits {A,*}. The “data” bit 
stream is packaged into data frames {6* , i = 1, . . . , B] of 
B bits each, and L sync marker bits {A i = 1, . . . , L} are 
included in every data frame. The “data” bit stream is 
convolutionally encoded by a rate l/N , constraint length 
K convolutional encoder. The encoded channel symbol 
stream i = 1, . . . , 5} is likewise partitioned into frames 
of S = NB symbols each, and each frame includes a set 
of M — N(L — K -f 1) sync marker symbols {m;, i = 
1, . . . , M} that are totally determined by the sync marker 
bits {A ,, L}. The remaining N(B — L + I< — 1) 

symbols in each frame are dependent solely on the true 
data bits or else on a combination of true data bits and 
sync marker bits. 

The channel symbols are assumed to have constant 
magnitude s (i.e., S{ = ±s), and they are received in ad- 
ditive white Gaussian noise {n,,i = 1,...,S} with zero 
mean and variance a 2 . The ratio p = s 2 / a 2 is a signal- 
to-noise parameter. In terms of p, the channel symbol 
signal- to-noise ratio is E s /No = p/2, and the bit-energy- 
to-noise ratio is E^/No = Np/2. The received symbols 
{r;, i = 1 ,...,5} are passed through a maximum likeli- 
hood convolutional decoder (Viterbi decoder) to obtain the 
decoded bits { d{ , i — 1, . . . , 5}. 

The general performance expressions in this article are 
derived for arbitrary combinations of the parameters K , 
N , L , M, B, and 5. However, all explicit performance 
curves assume a 32-bit marker sequence and the NASA- 
standard constraint length 7, rate 1/2 code (i.e., L — 32, 
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M = 52, K = 7, and N = 2). The curves in Section III 
are applicable to any frame length, while the curves in 
Section IV assume a frame length of 10000 bits or 20000 
symbols (i.e., B = 10000, 5 = 20000). 

B. Sync Methods Based on Decoded Bits 

Frame sync observables based on decoded bits are a 
function of the L - vector of discrepancies {A : - 0 d,, i = 
1, . . . , L) between the sync marker bits {A t , i — 1, . . . , L) 
and a sliding L-bit window of actual decoded bits {d;, i = 
Several reasonable statistics are described in 
this section. Further analysis of observables based on de- 
coded bits is not carried out in this article, but perfor- 
mance curves are cited from [1, 2] for comparison with 
channel symbol methods. 

1. Decoded Bit Discrepancy Count Statistic. 
In this case, the statistic x simply counts the number of 
discrepancies between the sync marker bits and the current 
L-bit window: 

L 

x = ^2 © d *i 

iz si 

This is the statistic used in the current DSN frame sync 
algorithm. 

2. Decoded Bit Discrepancy Span Statistic. 
Viterbi decoders tend to make bursts of errors and the 
decoded bits are essentially random inside an error burst 
[3]. Thus, any agreements with the sync marker that oc- 
cur inside a decoder error burst are completely accidental 
and should not be counted as true agreements. This ob- 
servation leads to the definition of the discrepancy span 
statistic. The discrepancy span measures the distance in 
bits between the first and last disagreements with the sync 
marker. The statistic x is defined simply by 

x — i 2 — *i + 1 

where 

ii = min {i : (A; 0 dA = 1} 

1<»<M 1 v 7 J 

*2 = O’ : ( A -' ® d i) = U 

This statistic was first suggested and analyzed in [2]. 


C. Sync Methods Based on Channel Symbols 

Frame sync observables based on channel symbols are 
obtained by comparing a sliding M-symbol window of re- 
ceived symbols {r t *, i = l,...,Af} with the sync marker 
symbols {m;,i = 1 Several reasonable channel 

symbol statistics are described in this section. 

1. Hard-Limited Channel Symbol Discrepancy 
Count. The channel symbol statistic most similar to the 
decoded bit statistics is obtained by hard limiting each re- 
ceived symbol r # - to the nearest transmitted symbol value 
±s and counting disagreements with the sync marker se- 
quence. A statistic x that simply counts the number of 
discrepant symbols is defined by 


x = - ^ [i - S gn (m. ) sgn (r, )] 


This statistic x counts one discrepancy for every disagree- 
ment in sign between a received symbol r, and the corre- 
sponding sync marker symbol m,- . 

Because the received symbols {r,} contain white 
Gaussian noise and are not prone to error bursts, burst- 
inspired statistics such as the discrepancy span statistic 
are not useful in the channel symbol domain. However, 
statistics that make use of the soft-quantized information 
in the channel symbols can offer improvement. 

2. Weighted Symbol Discrepancies. An interest- 
ing statistic using soft- quantized information is derived as 
a weighted sum of symbol discrepancies. The statistic x is 
defined by 

M 

x = y^max [0, -r< sgn (m,)] 

«=i 

Just like the hard-limited symbol discrepancy count statis- 
tic, this statistic x counts 0 every time the received symbol 
ri and the corresponding sync marker symbol m* agree 
in sign, no matter how inexact the agreement might be. 
However, this statistic does not count all sign discrepan- 
cies equally, but instead weights them by the magnitude 
of the corresponding received symbols. 
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The weighted symbol discrepancy statistic was first 
proposed by Massey [4] as a good approximation to the 
maximum likelihood detection statistic in the region of 
high signal-to-noise ratio. This statistic is not further an- 
alyzed in this article, but will be examined in the future 
by simulation or central limit theorem approximation. 

3. Channel Symbol Correlation Statistic. One 
natural soft-quantized statistic measures the correlation 
between the received symbols and the sync marker sym- 
bols. The statistic x is defined by 

M 

X = “ r « ) 

*=1 

The observed value of this statistic should be near 0 if 
{r,-, i = 1,...,A/} contains the marker, and otherwise 
should be a large positive number around Ms 2 . With this 
definition, the correlation statistic exhibits the same qual- 
itative behavior as the discrepancy-based statistics. 

4. Channel Symbol Squared Distance Statis- 
tic. Another natural symbol-based statistic measures the 
squared distance between the received symbols and the 
sync marker symbols. The statistic x is defined by 


a threshold 6 . The tentative decision rule for all statistics 
defined above is of the general form 

sync 

x - e 

> 

no sync 

If x exceeds 0 when x is measured at the true position 
of the marker, the tentative decision rule results in the 
true marker location being missed in the current frame. 
Conversely, if x falls below 0 when x is not measured at 
the true marker position, then the decision rule causes a 
false detection of sync or false alarm. 

The intrinsic goodness of various statistics x can be 
evaluated by comparing the tradeoff between probability 
of miss {Pm) and probability of false alarm (Pp), which 
results as the threshold 0 is varied for a given statistic 
x. The miss probability and false alarm probability are 
defined as 

Pm = Prob [x > 0\ sync marker located at 

current L-bit or M - symbol window] 

Pp — Prob [x < 0\ sync marker not located at 

current L-bit or M - symbol window] 


M 

X = J2( rn i- r if 

» = 1 

Again, a small value of x indicates agreement with the 
marker, while a large value indicates disagreement. 

III. Probability of Miss and Probability 
of False Alarm 

A. Basic Definitions 

All of the statistics x defined in the previous section 
are scalar; a value of x near 0 indicates a match with the 
sync marker, while a large positive value indicates disagree- 
ment. Thus, it is natural to use the observed values of x 
to make tentative yes-no decisions about the location of 
the marker, according to whether x falls below or exceeds 


The miss probability Pm is the probability that the 
true marker will fail to pass the threshold test at the true 
location of the marker. Within any one data frame, there is 
only one opportunity for a miss to occur. The false alarm 
probability Pp is the probability that the threshold test 
will be passed at some particular location that contains 
only true data bits and symbols or possibly a combination 
of true data bits and symbols and marker bits and symbols. 
In each frame there are 5—1 opportunities for false alarm 
for algorithms based on decoded bits, or 5-1 opportunities 
for algorithms based on channel symbols. 

B. Computation of Pm versus Pp for 
Various Observables 

1. Decoded Bit Discrepancy Count Statistic. 
The tradeoff between miss probability and false alarm 
probability for the decoded bit discrepancy count statis- 
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tic has been reported in [1, 2]. A graph showing some of 
the results of [2] is presented in Fig. 2. 

It must be noted that the miss probability and false 
alarm probability defined in [2] are not equal to those used 
in this article. In [2], the threshold test had to be passed 
at the same location in two consecutive frames for either 
a false alarm or detection of true sync to occur. The false 
alarm probability ( 0\M ') and miss probability l—P(0\M) 
defined in [2] are related to the corresponding probabil- 
ities Pf and Pm used in this article as P(0\M ( ) = Pp , 
1 - P(0\M) = 1 — (1 — Pm) 2 - 

2. Decoded Bit Discrepancy Span Statistic. 
The miss probability versus false alarm probability curves 
for the decoded bit discrepancy span statistic are shown in 
Fig. 3. These curves are taken from [2], after adjusting for 
the different definitions of miss probability and false alarm 
probability described above. 

3. Hard-Limited Channel Symbol Discrepancy 
Count Statistic. Measured at the marker, the hard- 
limited symbol discrepancy count statistic x is a binomial 
random variable, 


M i M 

* = £-[l-sgn (m.) sgn (m< + n<)] = 

»=i «=i 


where {x,*,i = are independent binary random 

variables, 

Prob [xi = 1] = q = 1 - Prob [xi = 0] 

with 


q = Prob [n, > m] = Q = Q (s/p) 

and 



The probability distribution for the statistic x is given by 
Prob [x = j] = q> (1 - q) M ~ j , j = 0, . . . , M 

The miss probability is easily evaluated as 

p M = e ftVa-t)*-' 

i=*+i ' 3 ' 

At any given location not overlapping the marker, 
the statistic x is conditionally a sum of two binomial ran- 
dom variables whose statistics depend on the discrepancies 
{<^, i = 1,... , A/} between the marker symbols {m*, i = 
1, . . . , M} and the sliding window of M symbols {$;, i = 
1, . . . , M) at the current location 

Si - - sgn ( mi ) sgn (s,*)] , i = 1, . . . , M 

Then, if w is the Hamming weight of the discrepancy se- 
quence {£*, i=l,..., M }, 

M 

v = £Si 

i=l 

the statistic x can be written as 


M-w j 

x= ^2 2 [ l - s S n ( m «) s « n ( m * + n «)J 

+ o f 1 ~ sgn sgn ( _m *' + "')] 

<=i 1 

M—w w 

= J2 + 

i=l »=1 

where {x*, i = 1, . . . , M — w} are independent binary ran- 
dom variables as defined above and { 2/» , i = 1, . . . , w} are 
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also independent binary random variables with a different 
probability, 

Prob [y t = 1] = p = 1 — Prob [y, = 0] 

The probability of false alarm is obtained by averaging the 
conditional probability that x will fall below threshold over 
the possible discrepancy weights w , 


M 

Pf — ^ Prob[iu] 

u/=0 

M — w w / x r v 

* e e( rV<' 

j—0 k=Q \ J / 

3+k<* 

x (r>‘ (i - pr ‘ 

The discrepancy weight probability distribution 
Prob[u;] was obtained by exhaustive enumeration for a 32- 
bit sync marker sequence and the NASA-standard con- 
straint length 7, rate 1/2 convolutional code. This dis- 
tribution is approximately binomial except at the tails 
(w near 0 or M). The exact distribution for the NASA- 
standard code is reported in Appendix A. 

The tradeoff between miss probability and false alarm 
probability for the hard-limited channel symbol discrep- 
ancy count statistic is plotted in Fig. 4 for signal-to-noise 
ratios Et/No ranging from —1.5 dB to +3.0 dB. 

4. Channel Symbol Correlation Statistic. Mea- 
sured at the marker, the channel symbol correlation statis- 
tic x is a sum of independent Gaussian random variables, 

M M 

x = 

t = l *=1 

where tq is 7V(0, s 2 <r 2 ), x/a 2 is 7^(0, Ms 2 /a 2 ) = N( 0, Mp). 
Thus, the miss probability is calculated simply as 


Pm ~ Q {vMso) 

Away from the marker, the correlation statistic x is 
a sum of conditionally Gaussian random variables, some 
with zero mean and some with nonzero mean: 

M—w w 

x = 52 + B 2 ™. 2 - m,ni ) 

i= 1 t=l 

M—w w 

= 52 u i+52 Vi 

i=l i=l 

where Vi is N(2s 2 , s 2 a 2 ), x/a 2 is N {2ws 2 / <r 2 ) Ms 2 /a 2 ) = 
N{2wp, Mp). 

The false alarm probability is obtained by averaging 
the conditional Gaussian probability distribution for x over 
the discrepancy weight distribution Prob[iu], 

/., = E Prob M <?( M ^) 

Figure 5 shows Pm versus P F for the channel symbol cor- 
relation statistic. 

5. Channel Symbol Squared Distance Statistic. 
At the marker, the squared distance statistic x is a sum of 
squares of M zero-mean Gaussian random variables, i.e., 
x is a chi-squared random variable with M degrees of free- 
dom: 

M 

x = 52 n i 

*=i 

where n t is 7V(0,cr 2 ), x/a 2 is x 2 (^f)- Away from the 
marker, the squared-distance statistic is conditionally a 
noncentral chi-squared random variable, 
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M-xv w 

x = E"‘ + H( 2m * - n «) 2 

»=1 • = 1 

M — tv w 

= £»?+!>? 

»=1 *= 1 

where Zi is AT(2m t -, <r 2 ), x/<t 2 is noncentral x 2 (M\4i*V>). 
The probability distribution functions for central and non- 
central chi-squared random variables with an even number 
of degrees of freedom can be related to the probability dis- 
tributions for certain Poisson random variables, as shown 
in Appendix B. Assuming that the number of marker sym- 
bols M is even, the miss probability and false alarm prob- 
ability can thus be expressed as 


Pm = Prob 


r Af 

E"?>° 


,i = 1 


= Prob 




C. Conclusions from the Pm versus Pp Curves 

Several important conclusions about the relative per- 
formance achievable by the various sync statistics can be 
drawn by comparing the tradeoff curves in Figs. 2 through 
6. A Pm versus Pf curve which is uniformly below and 
to the left of another indicates superiority in the corre- 
sponding sync statistic. For example, Figs. 2 and 3 appear 
to show that the decoded bit discrepancy span statistic is 
uniformly superior to the decoded bit discrepancy count 
statistic, although the advantage disappears at small Pf • 


Among the channel symbol methods, the squared dis- 
tance statistic yields superior performance for large E b /N 0 , 
while the correlation statistic overtakes it for very poor 
signal- to-noise ratio ( E b /N 0 < 0.5 dB). The performance 
of the hard-limited symbol discrepancy count statistic re- 
sembles the correlation statistic’s performance, but is de- 
cidedly inferior at very low signal- to-noise ratio ( Eb/N 0 < 
0 dB). The theoretically near-optimum weighted symbol 
discrepancy statistic may be better than all three, but it 
was not analyzed here. 


M 

Pf = Prob[u>] 

tu=0 


Prob 


M — tv 


E»? + E*?s 


L »=i 


i= 1 


M 

= ^ Prob[u»] Prob 

u>=0 


ki - k 2 (w) > 


M" 

2 


where k x is Poisson (0/2er 2 ), k 2 (w) is Poisson (2 wp), and 
Jbi, k 2 (w) are conditionally independent given w. Evaluat- 
ing the Poisson probabilities gives 


Pm = 



Pf 


]T] ProbH exp(-2 wp) 

z, n 2 * 


o° ( 9 \*i 

*1=^3+ ^2 



The resulting curves of Pm versus Pf are plotted in Fig. 6. 


Comparing the performances of the decoded bit meth- 
ods with those of the channel symbol methods is a more 
complicated issue, because the tradeoff curves are dissim- 
ilar in shape. The curves for the decoded bit statistics 
are much flatter than those for the channel symbol statis- 
tics. There is a distinct crossover point between the curve 
for one of the decoded bit statistics and the corresponding 
curve for one of the channel symbol statistics. To the left 
of the crossover point (small Pf), the decoded bit statis- 
tic is superior, while the channel symbol statistic performs 
better on the other side (large Pf). The judgment of which 
statistic is better depends on the desired operating point 
on the Pm versus Pf curves. As shown in Section IV, this 
desired operating point is a function of the frame length, 
the global acquisition strategy, and the preferred measure 
of acquisition timeliness. 

The flatness of the Pm versus P F curves for the de- 
coded bit statistics is due to the bursty nature of the 
Viterbi decoder errors. A marker-length section of decoded 
bits is likely to be either entirely correct or mostly garbage. 
Thus, only a small improvement in miss probability can be 
obtained by increasing the threshold 6 above 0, while the 
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false alarm probability increases substantially. The con- 
sequence of the flat Pf^ versus Pp curves is that the per- 
formance of the decoded bit schemes is basically limited 
by the Viterbi decoder’s error probability for decoding a 
32-bit symbol. Arbitrarily small false alarm probability 
(down to a minimum of approximately 2~ L , the probabil- 
ity that L data bits will accidentally reproduce the sync 
marker) can be purchased at little expense in Pa/. How- 
ever, reducing Pp too far has no benefit if the frame length 
is small. The acquisition failure probability is totally dom- 
inated by Pm in this case. Even with a large frame, there 
are other ways to reduce Pp dramatically, such as by re- 
quiring frame-to-frame verification before declaring sync. 
Such a technique has virtually no benefit if Pp can already 
be made small without it. 

While the decoded bit curves are very flat, the channel 
symbol curves show a lot of elasticity. Appreciable changes 
in Pp result in appreciable changes in Pm, and vice-versa. 
These methods show stronger response to frame-to-frame 
verification strategies. Of course, the increased elasticity 
also means that all parameters must be more delicately 
adjusted for optimum behavior. 

IV. Probability of Acquisition of True 
Sync 

A. Assumptions About the Acquisition 
Strategy 

The miss probability versus false alarm probability 
curves reflect the power of a given statistic x to discrim- 
inate in a pairwise manner between the true sync marker 
location and any one of the possible false locations. To 
compute the probability that sync will be acquired cor- 
rectly, it is also necessary to consider the global strategy 
that combines all of the bit-by-bit or symbol- by-symbol 
tentative decisions based on x to arrive at a declared sync 
position. 

The global strategy can be divided into two parts. 
First, how are the outcomes of all the tentative decisions 
within one frame period combined to select or rank the can- 
didate marker locations based on a single frame of obser- 
vations? Second, are observations from succeeding frames 
required to reevaluate the candidates from the first frame 
before declaring sync, and, if so, how are they applied? 
These two parts may be called the intraframe strategy and 
the frame-to-frame strategy, respectively. 


In this article, it is assumed that no sync candidates 
emerge from any frame in which the threshold test is failed 
at all locations; when this happens, the sync search is 
restarted from scratch in the next frame. If the thresh- 
old test is passed at one or more locations within a frame, 
it is assumed that the first such location tested is the 
unique sync candidate based on that frame’s observations. 
The current DSN sync algorithm can switch between this 
scheme and another slightly more accurate one that se- 
lects the most probable location (i.e., the one that passes 
the threshold test by the widest margin). The latter in- 
traframe strategy is not considered in this article. 

Two different frame-to-frame strategies are con- 
sidered. The simplest frame-to-frame strategy requires no 
correlation with succeeding frames. Sync is declared or 
not declared solely on the basis of the intraframe results 
in a single frame. A second frame-to-frame strategy is one 
currently used by the DSN, which requires next-frame ver- 
ification before declaration of sync. In this strategy, the 
unique sync candidate (if any) from any single frame is 
chosen as the sync location if and only if it is verified once 
in the next succeeding frame by repassing the threshold 
test at the corresponding location within that frame. 

Each acquisition strategy and sync statistic must be 
evaluated relative to an appropriate measure of perfor- 
mance. The appropriate performance measure is not neces- 
sarily the same for different strategies. For example, mini- 
mum acquisition times under an elaborate frame-to-frame 
verification strategy are guaranteed to be longer than un- 
der a single-frame strategy. In this article, the probability 
of acquisition of true sync in one frame is used to compare 
the performance of different basic observables x under the 
single-frame acquisition strategy. On the other hand, the 
desired measure of performance for the strategy requiring 
next-frame verification is the probability of acquisition of 
true sync within four frames or less. 

B. Probability of Acquisition of True Sync in 
One Frame 

For true sync to be declared after one frame length of 
observations, the true sync location must pass the thresh- 
old test, and no false sync location may pass the threshold 
test earlier. The probability that the true sync location 
passes the threshold test is 1 — Pm . The probability that 
any particular false sync location passes the threshold test 
is Pp. As stated earlier, it is assumed that the same false 
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alarm probability Pp applies both to data-only locations 
and to locations partially overlapping the marker. 

The probability that at least one false sync will be de- 
tected before the true sync location is tested can be upper 
bounded and approximated by multiplying the false alarm 
probability Pp by the total number of false sync locations 
tested prior to the true sync location. This number is a 
random variable that depends on the exact time the syn- 
chronization search was started relative to the true frame 
boundaries. If the synchronization search is started at a 
random time, the average number of false sync locations 
tested before the true sync location is ( B — l)/2 for algo- 
rithms based on decoded bits or ( S — l)/2 for algorithms 
based on channel symbols. 

In summary, true sync can fail to be acquired in one 
frame if either the true sync location fails the threshold 
test, or if a false alarm occurs at any of the false sync loca- 
tions tested earlier. These two events occur with probabil- 
ities Pm and [( B - l)/2 ]Pp or [(5 — l)/2 ]Pp, respectively. 
Thus, the probability of acquisition of true sync within one 
frame is approximated for small Pm and Pp by 


Prob [acquisition of true sync within one frame] 

Q J 

« 1 — Pm — Pf , for algorithms based 

on decoded bits 
5—1 

«1 — Pm x — Pf, for algorithms based 

z 

on channel symbols 

This probability is plotted in Fig. 7 for the five sync ob- 
servables considered in Figs. 2 through 6, assuming a frame 
length of 10000 bits or 20000 symbols and an optimized 
threshold at each signal-to-noise ratio. 

C. Probability of Acquisition of True Sync 
Within Four Frames, With Required 
Next-Frame Verification 

In this section, it is assumed that the frame- to- frame 
strategy requires that the threshold test be passed at the 
same location in two consecutive frames before sync is de- 
clared. For true sync to be declared within four frames, 


the true sync location must pass the threshold test either 
in the first and second frames, or in the second and third 
frames after failing to pass in the first frame, or in the third 
and fourth frames after failing the test in the second frame. 
These three events happen with probabilities (1 - Pm) 2 , 
P M ( 1 - Pm) 2 , and P M ( 1 - Pm) 2 , respectively, assuming 
that observations are independent from frame to frame. In 
addition, it is necessary that no false sync location be de- 
tected and verified before true sync can be declared. Any 
particular false sync location is detected and verified on 
the second frame, third frame, or fourth frame with prob- 
abilities P£, P£( 1 - Pf), or P£( 1 - Pf), respectively. The 
corresponding probabilities that at least one false sync is 
detected and verified on the second frame, third frame, 
or fourth frame can be upper bounded and approximated 
by multiplying the former probabilities by the number of 
possible false sync locations. 

To calculate the probability that a false sync declara- 
tion might preempt the possibility of declaring true sync, 
it is necessary to divide the possible false sync locations 
into two categories. Some false sync locations are first 
subjected to the threshold test prior to the true sync loca- 
tion, and some are first tested after the true sync location. 
The first category of false sync locations can preempt a 
true declaration of sync if both the false declaration and 
the true declaration occur in the same number of frames, 
while the second category causes trouble only if false sync 
is declared in fewer frames than true sync. If the frame 
sync process is started at a random point in the frame, the 
average number of false sync locations in each category is 
(B — l)/2 or (5 - l)/2. 

The dominant contribution to the probability that a 
false sync will be declared before true sync comes from 
the probability that false locations in the first category are 
detected and verified at the first opportunity, i.e., within 
two frames. This probability is upper bounded and ap- 
proximated by [(B — l)/2 \Pp or [(5 — l)/2]Fp. All other 
contributions to the probability of preemptive declaration 
of false sync involve terms of order PmPp and higher-order 
terms, and these terms are unimportant if both Pm and 
Pp are small. 

The previous observations can be summarized as fol- 
lows. True sync can fail to be acquired within four frames 
if the true sync location is not detected and verified within 
four frames or if a false sync location is detected and 
verified earlier. The first event happens with probabil- 
ity 1 - (1 - Pm) 2 ( 1 + 2 Pm) » ZPm , and the second event 
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happens with approximate probability [(B - l)/2 ]Pp or 
[(5 - l)/2]Pp. Thus, the probability of true acquisition 
within four frames is approximated for small Pm and Pp 
by 

Prob [acquisition of true sync within four frames 
with one required verification] 

g l 

« 1 - 3 Pm - — Pp, for algorithms based 

on decoded bits 
5—1 

« 1 — ZPlf Pp, for algorithms based 

2 

on channel symbols 

This probability is plotted in Fig. 8 for the same parame- 
ters considered in Fig. 7. 

V. Summary 

This article has analyzed three different sync statis- 
tics based on measuring channel symbols. The perfor- 


mance of these three statistics was analyzed without sta- 
tistical approximation for the basic tradeoff curves of miss 
probability and false-alarm probability. These basic per- 
formance curves were then extended to yield expressions 
for the probability of timely acquisition under two different 
acquisition strategies. 

It was shown that the statistics based on decoded 
bits are superior to the statistics based on channel sym- 
bols if the desired operating region utilizes a miss prob- 
ability many orders of magnitude higher than the false- 
alarm probability. This operating point is applicable for 
very large frame lengths and minimal frame- to- frame ver- 
ification strategy. On the other hand, the statistics based 
on channel symbols are superior if the desired operating 
point has a miss probability only a few orders of magnitude 
greater than the false-alarm probability. This happens for 
small frames or when frame- to- frame verifications are re- 
quired. Among the three channel symbol methods exam- 
ined, the squared-distance statistic offers the best perfor- 
mance in the range of normal signal-to-noise ratios, but it 
degrades more rapidly than the correlation statistic or the 
hard-limited symbol discrepancy count statistic when the 
signal-to-noise ratio is extremely low. 
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SYNC MARKER BITS {Xj , i = 1 L} ARE TRANSMITTED ONCE EVERY DATA FRAME {bj , I = 1 B} 

WITHIN EVERY SYMBOL FRAME {Sj , i = 1 S} THE SYNC MARKER SYMBOLS {m| , i = 1 M} 

ARE THOSE SYMBOLS THAT ARE COMPLETELY DETERMINED BY THE SYNC MARKER BITS 

S = NB M = N(L - K + 1) 


Fig. 1. System diagram for two sync methods. 



Fig. 2. P M versus P F for discrepancy count statistic 
based on decoded bits. 



Fig. 3. P M versus P F for discrepancy span statistic 
based on decoded bits. 














Fig. 8. Probability of failure to acquire true sync within four 10000-bit frames, 
with required next-frame verification. 
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Appendix A 

52-Symbol Weight Distribution for the 
NASA-Standard (7,1/2) Code 


Evaluation of the false-alarm probabilities for the 
three channel symbol statistics analyzed in Section III re- 
quires knowledge of the probability distribution of the 
Hamming weights of the possible discrepancy sequences 

i = 1, . . . , M } between the sync marker symbols {m*, 
* = 1 and the sliding window of M symbols 

{$i, i = l,...,Af} at the current location. The exact 
weight distribution for a 32-bit sync marker and the 
NASA-standard (7,1/2) code was generated by exhaustive 
enumeration. Since the convolutional code is linear, this 
weight profile is equivalent to the weight profile generated 
by encoding all possible 32-bit patterns. The 2 32 possi- 
ble bit patterns were encoded one by one, and a complete 
histogram of their encoded weights was produced. This 
process required about 14 days of CPU time on a Sun 3 
computer. 

It is shown in [5] and [6] that the weight distribution 
of a binary primitive block code is approximately bino- 


mial. Since the convolutional code is linear, and the 52-bit 
patterns represent truncated sequences from the output of 
the convolutional encoder, these 52-bit patterns can be re- 
garded as the codewords of a (52,32) binary block code. 
Let A w denote the number of patterns of weight w. Then 
according to [5] and [6] A w can be approximated by the 
following binomial coefficient: 


A w « 2 20 = A ' w , 0 < w < 52 


A comparison between the exact weight distribution 
A w and the approximate distribution A' w is given in Ta- 
ble A-l. It is observed that A' w is a good approximation 
to A w except at the two extremes (tu near 0 or 52). 
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Table A-l. Encoded weight profile for all possible 32-bit input 
sequences, encoded by the NASA-standard (7,1/2) code 


Weight, 

w 

Histogram totals 

Ratio, 

Aw/A' w 

Actual, 

Aw 

Binomial, 

A' 

0 

1 

0 

1048576.000 

1 

0 

0 

0.000 

2 

2 

0 

1581.563 

3 

7 

0 

332.128 

4 

18 

0 

69.718 

5 

40 

2 

16.138 

6 

122 

19 

6.284 

7 

333 

128 

2.610 

8 

963 

718 

1.342 

9 

3634 

3509 

1.036 

10 

14244 

15087 

0.944 

11 

54082 

57605 

0.939 

12 

191047 

196819 

0.971 

13 

600241 

605596 

0.991 

14 

1692962 

1687018 

1.004 

15 

4310848 

4273778 

1.009 

16 

9926772 

9883112 

1.004 

17 

20917283 

20928944 

0.999 

18 

40618430 

40695169 

0.998 

19 

72723310 

72822934 

0.999 

20 

120117817 

120157841 

1.000 

21 

183174400 

183097662 

1.000 

22 

258139236 

258001251 

1.001 

23 

336646298 

336523371 

1.000 

24 

406667062 

406632406 

1.000 

25 

455311348 

455428295 

1.000 

26 

472746296 

472944768 

1.000 

27 

455311348 

455428295 

1.000 

28 

406667062 

406632406 

1.000 

29 

336646298 

336523371 

1.000 

30 

258139236 

258001251 

1.001 

31 

183174400 

183097662 

1.000 

32 

120117817 

120157841 

1.000 

33 

72723310 

72822934 

0.999 

34 

40618430 

40695169 

0.998 

35 

20917283 

20928944 

0.999 

36 

9926772 

9883112 

1.004 

37 

4310848 

4273778 

1.009 

38 

1692962 

1687018 

1.004 

39 

600241 

605596 

0.991 

40 

191047 

196819 

0.971 

41 

54082 

57605 

0.939 

42 

14244 

15087 

0.944 

43 

3634 

3509 

1.036 

44 

963 

718 

1.342 

45 

333 

128 

2.610 

46 

122 

19 

6.284 

47 

40 

2 

16.138 

48 

18 

0 

69.718 

49 

7 

0 

332.128 

50 

2 

0 

1581.563 

51 

0 

0 

0.000 

52 

1 

0 

1048576.000 

Total 

2 32 

2 32 
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Appendix B 

Relationship of Noncentral Chi-Squared Probability 
Distributions and Poisson Probability Distributions 


Let £ be a noncentral chi-squared random variable 
with M degrees of freedom and noncentrality parameter 
/i. Then, from [7], the probability distribution function of 
£ can be expanded in the form 


(Af/2)+*-l f • 

Prob({, > 9] = £ 

i= o 


These two results combine to yield 


ProbK > e\ = Prob & > 

k = 0 


where {£*,jfc = 1,2,...} are central chi-squared random 
variables with increasing numbers of degrees of freedom, 




£* is * 2 (Af + 2fc), fc = 1,2, . . . 


= Prob 


j — k < 


2 



Also from [7], the probability distributions for each of the 
central chi-squared random variables can be related to the 
Poisson distribution if M is even, according to 


where j and k are independent Poisson random variables 
with means 9/2 and /i/2, respectively. 
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Performance of the Image Statistics Decoder in 
Conjunction With the Goldstone-VLA Array 

H. C. Wang and G. H. Pitt III 
Communications Systems Research Section 


During Voyager’s Neptune encounter, the National Radio Astronomy Observa- 
tory’s Very Large Array (VLA) will be arrayed with Goldstone antennas to receive 
the transmitted telemetry data from the spacecraft. The telemetry signal from 
the VLA will drop out periodically, resulting in a periodic drop in the received 
signal-to-noise ratio (SNR). The Image Statistics Decoder (ISD), which assumes a 
correlation between pixels , can improve the bit error rate (BER) for images during 
these dropout periods. Simulation results have shown that the ISD, in conjunction 
with the Goldstone— VLA array can provide a 3-dB gain for uncompressed images 

at a BER of 5.0 x 10~ 3 . 


I. Background 

Throughout the Voyager mission, the DSN has been 
used for telemetry reception. As Voyager approaches Nep- 
tune, the signal must travel a long distance to reach earth, 
resulting in a low received SNR. In order to assure that the 
received Voyager data still achieves an acceptable BER, 
the National Radio Astronomy Observatory’s Very Large 
Array (VLA) in the high New Mexico desert can assist 
the DSN station at Goldstone in receiving the Voyager 
telemetry data during Neptune encounter [1]. However, 
VLA telemetry has a dropout period of 1.6 millisec every 
52 millisec. In [2], it was concluded that due to the loss of 
data in the dropout periods, the VLA alone would not be 


able to support the (7,1/2) convolutional code for provid- 
ing an acceptable BER for Voyager telemetry data. 

Arraying the VLA with Goldstone antennas will result 
in a 3-dB gain in SNR, except during the VLA dropout 
period. 


II. BER Improvement Using the ISD 

The Image Statistics Decoder (ISD) has been devel- 
oped for decoding uncompressed images [3]. When de- 
coding uncompressed images, the ISD has superior BER 
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performance over the Maxi mum- likelihood Convolutional 
Decoder (MCD) which is currently used for decoding all 
data from Voyager. The ISD’s algorithm is based on the 
fact that adjacent pixels in an image are very similar in 
their intensities; hence, a pixel is likely to be close to its 
neighbors in intensity. During the VLA’s dropout periods, 
only Goldstone will receive the signal from Voyager. Were 
the SNR to drop to a point where the MCD could not de- 
code the data at an acceptable BER, the ISD may be used 
for decoding the data. In addition, during other situations 
when the received SNR dropped too low for the MCD to 
decode a recognizable image, the ISD could also be used 
as an alternative to the MCD. 

It should be pointed out that due to the large amount 
of computations and comparisons the ISD needs, as well 
as the fact that it currently only exists in software, the 
ISD is much slower than the MCD. The ISD takes three 
days on a SUN 3/260 to decode one image. Table 1 in [2] 
shows the speed of the ISD with different computers. 


III. Simulation 

Simulations have been performed of image reception 
at Goldstone alone and of an equal aperture Goldstone- 
VLA array. In each case, the MCD and the ISD were 
used on identical data. An image of the Uranus moon 
Miranda (Fig. 1) was provided by the Voyager project. 
It was first packed into image frames, which were then 
packed into playback frames. After the playback frames 


were convolutionally encoded (just as the spacecraft would 
do), Gaussian noise was added to the stream of data. 
For the Goldstone-alone case, the noise was stationary; 
for the Goldstone-VLA case, the SNR ratio was dropped 
by 3 dB for 1.6 millisec every 52 millisec to simulate the 
VLA telemetry dropout. These simulated received teleme- 
try streams were then decoded in software, either with an 
MCD or with the ISD inside images, and with MCD out- 
side images. After decoding, the nonimage parts of the 
frame were stripped off, and the resulting image was com- 
pared to the original image to compute bit and byte error 
rates. 

Figure 2 shows the BER for both decoders. The Gold- 
stone stand-alone system has a constant SNR, while the 
Goldstone-VLA array is at its nominal level 97 percent of 
the time and 3 dB lower during the dropout period. 

In the simulated SNR range, the ISD always has a 
better BER than the MCD. In particular, at a bit error 
rate of 5.0 x 10~ 3 , which is the maximum acceptable BER 
for image reception, the ISD has about a 3-dB gain. Note 
the steeper slope of the ISD curves at low SNRs. A con- 
jecture for this phenomenon is a higher performance gain 
from a few good pixels among a large number of bad pix- 
els at low SNRs. This is in contrast to the high SNR case 
where most pixels are good, hence yielding a lower per- 
formance gain with the addition of the same number of 
pixels. Figure 2 also shows that both the MCD and the 
ISD for the Goldstone-VLA array lose about 0.2 dB when 
compared to Goldstone alone at the same nominal SNR. 
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A Closed-Cycle Refrigerator for Cooling 
Maser Amplifiers Below 4 Kelvin 

M. Britcliffe 

Radio Frequency and Microwave Subsystems Section 


A helium refrigerator utilizing the Gifford-McMahon/Joule-Thomson (GM/JT) 
cycle was designed and tested to demonstrate the feasibility of using small closed- 
cycle refrigerators as an alternative to batch-filled cryostats for operating temper- 
atures below 4 K. The systems could be used to cool low-noise microwave maser 
amplifiers located in large parabolic antennas. These antennas tilt vertically f mak- 
ing conventional liquid- filled dewars difficult to use. The system could also be used 
for a non-tilting beam waveguide antenna to reduce the helium consumption of a 
liquid helium cryostat. The prototype system is adjustable to provide 700 mW of 
cooling at 2.5 K to 3 W at 4.3 K. Performance of the unit is not significantly af- 
fected by physical orientation. The volume occupied by the refrigerator is less than 
0.1 m 3 . Two JT expansion stages are used to maximize cooling capacity per unit 
mass flow. The heat exchangers were designed to produce minimum pressure drop 
in the return gas stream. Pressure drop for the entire JT return circuit is less than 
5 kpa at a mass flow of 0.06 g/sec when operating at 2.5 K. 


I. Introduction 

The DSN currently uses low-noise maser amplifiers 
operating in 4.5-K closed-cycle refrigerators on 70- and 34- 
m Cassegrainian antennas. Planetary spacecraft programs 
planned for the 1990s would greatly benefit from improved 
downlink performance from these antennas. One strategy 
for improving downlink performance is to further lower the 
noise temperature of the antenna systems. Two methods 


for reducing the system noise temperature are lowering the 
physical temperature of the maser well below the presently 
used 4.5 K, and cryogenically cooling the microwave com- 
ponents in the feed system between the feed horn and 
the maser. 

New 34-m beam waveguide antennas are being im- 
plemented in the DSN that provide a large non-tipping 
location for the maser and feed components. For these 
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antennas, a batch-filled liquid helium cryostat provides a 
reliable method to accomplish both the above improve- 
ments. However, for the existing 70-m and 34-m antennas 
with feedc one- mounted masers, a new type of closed-cycle 
refrigerator that will operate at lower temperatures in a 
tilting environment is required. A similar closed-cycle re- 
frigerator could also be used in conjunction with a liquid 
helium cryostat in the beam waveguide antennas to recon- 
dense the helium boil-off from the dewar and eliminate the 
need for manually refilling the system with liquid helium. 
This article describes the initial development of a refriger- 
ator configuration that could satisfy these requirements. 

One of the simplest approaches to providing closed- 
cycle cooling at temperatures below 4 K is to operate a 
conventional Joule-Thomson/Gifford- McMahon (JT/GM) 
cooler at a subatmospheric JT return pressure by pumping 
the gas returning from the JT circuit with a vacuum pump. 
The final temperature achieved using this configuration 
is a function of the pressure drop in the return path of 
the JT circuit heat exchangers and the capacity of the 
vacuum pump. The cooling capacity available at the final 
stage is limited by the JT mass flow, which is a function of 
the capacity of the GM expander and the efficiency of the 
heat exchangers. Unfortunately, the pressure drop in the 
heat exchangers also increases as the square of the mass 
flow, which limits the ultimate cooling capacity for a given 
operating temperature. 

The refrigerator described in this article was designed 
specifically for operation at subatmospheric JT return 
pressures. A two-stage JT expansion was used to max- 
imize the available cooling capacity per unit mass flow. 
The heat exchangers were designed to provide low pressure 
drop in the return path and to take maximum advantage 
of the GM engine capacity at each heat exchanger stage. 
Adjustable expansion valves are used to allow the cooling 
capacity and temperature to be varied from 700 mW at 
2.5 K to 3 W at 4.4 K. 


II. Joule-Thomson Circuit Capacity 

The refrigeration capacity of the final stage of a con- 
ventional Joule-Thomson refrigerator can be approxi- 
mated by doing an enthalpy balance analysis of a system 
consisting of the final heat exchanger, the JT expansion 
valve, and the final heat station (see Fig. 1). If the capac- 
ity of the refrigerator is considered to be the constant tem- 
perature capacity (i.e., the the maximum heat load that 
can be applied without warming the refrigerator above the 
saturation temperature), the fluid at the outlet of the heat 


station can be considered to be saturated vapor. In this 
case the refrigeration capacity is equal to the heat of va- 
porization of the liquid fraction of the flow through the JT 
valve. The constant temperature capacity is given by the 
following: 

q = h 3 - h 2 or q = h 8at - h 2 (1) 


where h sat is the enthalpy of saturated vapor at the final 
heat exchanger return side inlet and h 2 is the enthalpy 
of the fluid exiting the JT valve. If the JT expansion is 
isenthalpic then h 2 is equal to the value of the enthalpy 
leaving the cold side of the heat exchanger. Then, h 2 can 
be expressed as follows: 

h 2 — </he — hj (/i 4 h sa t) (2) 


Where h x is the enthalpy of the fluid at the inlet of the 
heat exchanger, q^ e is the heat transferred from the warm 
stream to the cold stream in the exchanger, and h 4 is the 
enthalpy at the outlet of the cold return of the heat ex- 
changer. Combining Eqs. (1) and (2) gives the following: 

q — h 4 — hi (3) 


Heat exchanger efficiency is defined by the following: 
e = -^- = h out --^--h itt (4) 

9max ™out' 

where % e is the actual heat exchange between the warm 
and cold streams in the heat exchanger and < 7 max is the 
maximum possible heat transfer with no thermal resis- 
tance between the warm and cold stream. If the heat 
capacities of the supply stream and the return stream are 
different, the heat exchanger is “pinched.” This means 
that the heat transfer between the two streams is limited 
to the smaller of the two enthalpy changes. The primed 
enthalpy value indicates the fluid properties that would be 
achieved if the thermal resistance in the heat exchanger is 
zero. 

In the case of operation near 4 K (100-kpa return 
pressure) a JT supply pressure can be chosen (1.6 Mpa) 
so that the heat exchanger pinch that occurs in any of the 
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stages is small. Equation (4) can be written in terms of 
the enthalpy change as follows: 


/i 4 = e h4 t 


( 5 ) 


Substituting this expression into Eq. (3) yields the follow- 
ing: 


q = e h 4 > - h\ 


( 6 ) 


This shows that the final capacity of the Joule-Thomson 
stage is a function of the enthalpy of the fluid entering the 
final heat exchanger and the heat exchanger efficiency. 

At temperatures below 4 K there is no single oper- 
ating point for the JT supply pressure that will prevent 
an imbalance of the heat capacity between the supply and 
return gas in either the final heat exchanger or the second- 
stage heat exchanger. If the enthalpy change (h 4 — h s at ) of 
the final stage return stream is smaller than the enthalpy 
change (/i 2 > — ^i) in the supply stream, the temperature 
of the fluid entering the expansion valve will be higher. 
This will result in a smaller fraction of the JT flow be- 
ing liquified and therefore reduce the capacity. A pinch 
in the second-stage heat exchanger upstream of the GM 
second stage will increase the loading on the GM stage 
and increase the temperature of the gas entering the final 
stage. 

Collins [1] suggested using multiple stages of JT ex- 
pansion to optimize the heat transfer in the JT heat ex- 
changers as a method to avoid this problem. The heat 
capacity in the warm and cold streams in the heat ex- 
changers can be balanced more closely by expanding the 
fluid in several steps along the final heat exchanger. This 
process results in the enthalpy of the fluid being lowered 
to the minimum level possible before the final expansion 
valve. This causes the largest possible fraction of the flow 
through the final expansion to be liquefied and yields the 
maximum final stage cooling capacity. Figure 2 shows 
the schematic of a refrigerator using two stages of JT ex- 
pansion. Figure 3 represents the cycle on a temperature- 
entropy diagram. The dotted line indicates the path for a 
single JT expansion. The area under the constant temper- 
ature line at 2.5 K represents the cooling capacity of the 
refrigerator. The shaded area between the final expansion 
end points represents the cooling capacity gained by using 
two expansions. 


III. Heat-Exchanger Return-Path 
Pressure Drop 

Return-path pressure drop is a critical parameter in 
the design of JT refrigerators because it determines the 
pressure and therefore the temperature of the helium bath 
in the final stage. Designing heat exchangers for these 
refrigerators requires a compromise between heat exchange 
and pressure drop. Factors that improve heat exchange 
(small cross-sectional area and long path lengths) tend to 
increase pressure drop. 

The pressure drop for heat exchangers with flow 
across tubes is given by Hogan [2] to be the following: 


A P = 


L M 2 
f D e A 2 p 


( 7 ) 


where / is the friction factor, which is a function of the 
Reynolds number that involves the tube size and the fluid 
properties in the heat exchanger; L is the heat exchanger 
length, and D e is the hydraulic diameter of the flow pas- 
sage; M is the JT mass flow rate, A is the passage cross- 
sectional area, and p is the fluid density. The mass flow 
squared dependence of pressure drop shows the need to 
keep the JT flow as low as possible. Table 1 shows the 
calculated pressure drop for the three heat exchangers in 
the JT circuit for a nominal mass flow rate of 0.08 g/sec. 
The pressure drop in the first-stage exchanger is the largest 
contributor to pressure drop by a factor of 10. 


IV- Hardware Description 

A prototype closed-cycle refrigerator was built to 
study the feasibility of using the multiple JT valve con- 
cept to obtain temperatures of 2 K and below. The device 
features a two-stage JT expansion with adjustable expan- 
sion valves to optimize cooling capacity as a function of 
mass flow. The heat exchangers were designed to provide 
minimum pressure drop and still provide adequate heat 
transfer. Many of the components used in the fabrication 
of the prototype such as the final cold stage and the ther- 
mal switch used to aid in the cooldown of the system were 
adopted from existing JPL designs and are described by 
Higa and Weibe [3]. 

The entire refrigerator is less than 70 cm long and 
20 cm wide and occupies a volume of less than 0.1 m 3 . 
The JT circuit is contained in a cylindrical vacuum housing 
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that is 20 cm in diameter and 40 cm long. The complete 
assembly, shown in Fig. 4, weighs 20 kg. 


A. Gifford-McMahon Expander 

The refrigerator uses a Varian two-stage GM expander 
designed for cryo-pump applications. The device was de- 
signed to provide cooling at 20 K and 80 K with nominal 
cooling capacities of 5 W and 35 W respectively. The 
measured cooling capacity of the device as a function of 
temperature using a JPL 5-hp compressor is plotted in 
Figs. 5 and 6. The JT circuit heat sinks were bolted to 
the displacer cylinder flanges rather than soldered directly 
to the cylinder. 


B. Heat Exchangers 

The heat exchangers were adapted from those devel- 
oped for the JPL prototype 2-W, 4-K refrigerator [4, 5]. 
They consist of a helix of corrugated bronze tubing wound 
on a Micarta mandrel. This assembly is housed in a tightly 
fitting stainless steel outer tube. The high pressure fluid 
travels through the inside diameter of the bronze tubing 
and the low pressure stream returns over the outside of 
the tube in the annulus between the inner mandrel and 
the outer tube in a cross-flow arrangement. The sizes and 
measured efficiencies of the heat exchangers are tabulated 
in Table 2. 


C. Compressor and Vacuum Pump 

A JPL 5-hp two-stage compressor was combined with 
a 21-1/sec direct-drive vacuum pump (Leybold-Heraeus 
S 65 BC) to provide the proper operating pressures for 
the refrigerator. A schematic diagram of the system is 
shown in Fig. 7. The helium compressor will supply the 
GM expander with 2.0-Mpa helium and maintain a 700- 
kpa return pressure. The JT circuit is supplied with 1.0- 
to 2.0-Mpa gas. Gas returning from the JT circuit is evac- 
uated by the vacuum pump and then returned to the com- 
pressor at 110 kpa. An oil separator returns any vacuum 


pump oil vapor to the pump. The pressure at the vacuum 
pump inlet can be maintained below 50 torr for any flow 
rate below 0.25 g/sec, which is the highest practical flow 
rate for the refrigerator. 


V. Performance 

The adjustable expansion valves allowed comparisons 
of the refrigerator’s performance using single and double 
JT expansions. Final-stage heat capacity is plotted as a 
function of temperature for both cases in Fig. 8. Using 
a two-stage expansion results in a 30-percent increase in 
cooling capacity at 2.5 K. These measurements were taken 
with a fixed mass flow of 0.06 g/sec. Supply pressure for 
the single-stage measurement was 1.6 Mpa. For the two- 
stage measurement, the high pressure was 1.6 Mpa for the 
first stage and 600 kpa for the second stage. 

Tests performed with an additional heat load applied 
to the first GM engine stage show that the efficiency of the 
first stage heat exchanger could be reduced by 30 percent 
and still allow normal JT stage operation. Utilizing a less 
efficient but lower pressure drop perforated-plate-type heat 
exchanger in the first stage should allow the refrigerator 
to operate below 2 K. 


VI. Conclusions 

Tests of the prototype refrigerator confirm the feasi- 
bility of using a small GM/JT refrigerator for tempera- 
tures of 2.5 K and below. Two of the major factors in- 
volved with designing refrigerators for this application are 
minimizing the JT circuit return-path pressure drop and 
optimizing the refrigeration capacity per unit mass flow. 
Ninety percent of the pressure drop in the JT return path 
occurs in the first-stage (300-40 K) heat exchanger. This 
pressure drop can be lowered by using a less efficient plate- 
fin type device and allowing the first stage of the GM ex- 
pander to absorb the excess heat. The heat capacity per 
unit mass flow can be improved by using a two-stage JT 
expansion. 
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Table 1. Calculated heat exchanger pressure drop 


Heat exchanger 

Temperature, K 

Pressure drop, Kpa 

1 stage 

175 

3.9 

2 stage 

20 

0.24 

3 stage 

7 

0.06 


Table 2. Prototype heat exchanger efficiencies and sizes 


Heat exchanger 

Efficiency, percent 

Length, cm 

Diameter, cm 

1 stage 

90 

25 

3.75 

2 stage 

97 

20 

3.10 

3 stage 

98 

35 

3.10 
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Fig. 1. Single-stage JT schematic 
diagram. 
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Fig. 2. Two-stage JT circuit schematic 
diagram. 
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Fig. 5. Varian GM refrigerator first-stage capacity. 
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Fig. 4. Prototype refrigerator. 
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Fig. 6. Varian GM refrigerator second-stage capacity. 
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Fig. 8. Final-stage cooling capacity as a function of 
temperature. 
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Wideband Phase-Locked Angular Modulator 
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A phase-locked loop (PLL) angular modulator scheme has been proposed which 
has the characteristics of wideband modulation frequency response . The modulator 
design is independent of the PLL closed-loop transfer function H(s), thereby allow- 
ing independent optimization of the loop f s parameters as well as the modulator J s 
parameters. A phase modulator implementing the proposed scheme was built to 
phase modulate a low-noise phase-locked signal source at the output frequency of 
2290 MHz. The measurement results validated the analysis by demonstrating that 
the resulting baseband modulation bandwidth exceeded that of the phase-locked 
loop by over an order of magnitude. However, it is expected to be able to achieve 
much wider response still. 


I. Introduction 

Phase-locked loops (PLL) are sometimes designed to 
mechanize the generation of phase modulation (PM) or 
frequency modulation (FM) signals. In a synthesized sig- 
nal generator, the same frequency synthesis loops are often 
also used for providing PM and/or FM capability. As such 
an angular modulator, the PLL has the following desirable 
characteristics: the modulated carrier frequency can be 
stabilized by a stable reference signal; and a large modu- 
lation index can be obtained by suitable phase compression 
in the loop [1]. In a typical phase- locked angular modula- 
tor, shown in Fig. 1, the modulating signal is added to the 
appropriate baseband points in the loop to produce the 
desired PM or FM [2]. An analysis of this scheme shows 
that the modulation transfer function relating the output 
phase to the input phase modulating voltage is propor- 


tional to the PLL closed-loop transfer function H(s) (a 
low-pass function), whereas the function relating the out- 
put instantaneous frequency to the input frequency modu- 
lating voltage is proportional to the complementary high- 
pass of H(s). This dependence of the modulation transfer 
function on the PLL closed-loop characteristics has the 
following consequences: 

(1) The configuration of Fig. 1 does not lend itself to 
easy implementation of both PM and FM using 
the same loop. For PM, a sufficiently wide loop 
bandwidth is required to accommodate the high 
frequency modulating signal, whereas for FM a 
narrowband loop is needed to prevent the loop 
from tracking out the low frequency modulating 
signal. 
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(2) Wideband modulation frequency response is of- 
ten not possible since the design is usually con- 
strained by considerations such as spurious signal 
rejection, loop stability, settling time, and noise 
performance that place upper and lower limits on 
the loop bandwidth. 

(3) The loop introduces phase distortion to the out- 
put modulating envelope because of its band- 
limited nature. 

A phase-locked modulator scheme is proposed that 
eliminates the above limitations. It will be shown that the 
modulation transfer function resulting from this scheme, 
which has the characteristics of wideband frequency re- 
sponse, is independent of the PLL transfer function. An 
experimental phase modulator was built. The measured 
data substantiate the analysis by demonstrating that the 
resulting baseband modulation bandwidth exceeded that 
of the PLL by over an order of magnitude. 


Simplifying yields 


0o(s) 


KoKpFjs) 

s + K 0 K d F(s) 


Qr(s) 


+ 


KpPjs) 

s + K 0 K d F{s) 


M«) 


+ 
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( 2 ) 


The quantity KpKpFjs)/ [s + Koitr/}i r (s)] is the closed- 
loop transfer function, denoted Hjs). Thus, 


0 o (s) = H(s)Q r (s) + 


KpPjs) 

s + K 0 K d F(s) 


V m (s) 


+ 


K 0 F(s)Q(s) 
s + K 0 K d F{s) 


V M (s) 


(3) 


II. Analysis 

Figure 2 shows the generalized block diagram of the 
proposed modulator scheme. A noise-free and linearized 
model of a PLL consisting of a phase detector, loop filter, 
and a voltage-controlled oscillator (VCO) is assumed. The 
following notation will be used: 


©fl(s) 

0o(s) 

0 m(s) 

Qm(s) 

K d 

K 0 

Fjs) 

Pjs), Qjs) 

V M js) 


reference phase, rad 

output phase, rad 

output phase modulation, rad 

output frequency modulation, 
rad/sec 

phase detector gain, volt/rad 
VCOs gain constant, rad/sec/volt 
loop filter 
modulation filters 
input modulating signal, volt 


0ii(s) defines the reference phase for the loop to maintain 
lock and is constant as far as the modulation is concerned. 
So, the output phase modulation due to the input modu- 
lating signal is 


6m(s) 


Ko P(s) 


8 + KoK D F(8) 

Kq F(s)Q(s) 
s + K 0 K d F(s) 


V m {s) 


Vm(s) 


(4) 


which can be rearranged to yield 


©Af(s) 


± Kq Kg Vm(s) 

s + K 0 K d F(s) s W K d 

KoKpFjs) 0 (s) Vm(s) 
s + K 0 K d F(s) WK ’ K d 


(5) 


The output phase for the loop in Fig. 2 is given by 
0o(s) = ^|p(*)V^(s) + F(s)[g(s)V^(s) 

+ifi>(eji(*)-©oW)]} (1) 


or 

= { [l - H(s)] K^Pjs) + Hjs)Q(s ) | 

"(6) 

The output instantaneous frequency is the derivative 
of the phase. Hence the output frequency modulation is 
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^m(s) = sQm(s) 


= I [i - #(*)] P(s) + ^(5)^-g( s )|K 0 K A/ ( 5 ) 

(7) 

It is evident from Eqs. (6) and (7) that the design 
strategy is to choose the correct modulation filters P(s) 
and Q(s) that would enable the appropriate compensa- 
tions to eliminate the terms involving H (s) and thus allow 
the modulation response to be independent of the PLL 
transfer function parameters. That is, it is desirable to 
choose P(s) and Q(s) such that the sums inside the brack- 
ets of Eqs. (6) and (7) simplify to unity. The two cases of 
PM and FM are considered. 


A. Phase Modulation 

Choose P(s) = s/KoKp. This is the transfer function 
of a differentiator where the time constant equals l/K 0 K D , 
and choose Q(s) = 1. Then Eq. (6) becomes 

e*(.) - {[i - *(.» + *(.)}^ (8 ) 

or 

0jl/(s) = y^ V m{s) (9) 

The output phase is thus proportional to the input 
modulating signal, scaled by the constant 1 /Kr>. This is 
the desired result. 

B. Frequency Modulation 

Choose P(s) = 1, and choose Q(s) = K 0 Kp/s. This 
is the transfer function of an integrator with the time con- 
stant equal to 1/KqKd. Then, Eq. (7) becomes 

Q M (s) = {[1 - H(s)] + ms) K S KD -°-f R }^oV M (s) 

( 10 ) 

or 

n M (s) = KoVm(s) (11) 

The output instantaneous frequency is thus propor- 
tional to the input modulating signal, scaled by the con- 
stant Kq. This is the desired result. 


III. Discussion 

Equations (9) and (11) demonstrate that the output 
PM and FM frequency responses are independent of the 
PLL transfer function H(s). Specifically, the detail of the 
loop filter F(s) is completely irrelevant as far as the over- 
all modulation process is concerned. To the extent that 
the linearized model of the PLL is valid, and to the de- 
gree that P(s) and Q(s) can be realized, the design of the 
modulator can be completely independent from the PLL 
closed-loop characteristics; the important parameters, as 
shown by the analysis, are the phase detector and the VCO 
gain constants. This independence of the modulation fre- 
quency response from the transfer function H(s) is ex- 
tremely desirable. Since Eqs. (9) and (11) are implied, the 
proposed modulator scheme is a wideband configuration 
and therefore does not suffer from the inherent limitations 
associated with the configuration of Fig. 1 as mentioned 
above. 

It is possible to obtain wideband response in the con- 
figuration of Fig. 1 by introducing the appropriate com- 
pensation networks to the input modulating signal prior 
to the PLL modulation terminals and external to the PLL. 
These networks would be in the form of l/H(s) or 1/[1 — 
J7(s)] for PM or FM, respectively. The drawback is that 
H(s) must be accurately known. Also, if a high-order loop 
is required, the resulting high-order compensation network 
could be difficult to design. Furthermore, the loop would 
invariably have spurious poles contributed by the loop fil- 
ter network. This would make extremely difficult the task 
of fully characterizing H(s) to the degree required for wide- 
band compensation. 

Based on the results of Eqs. (9) and (11) it is pos- 
sible to propose an entirely new modulator configuration 
which is shown in Fig. 2. This scheme takes advantage 
of the feedback loop and applies the modulation to both 
baseband positions before and after the loop filter such 
that the modulator design is free from the effect and con- 
straint of F(s). The compensation networks are simply 
the differentiator and the integrator whose time constants 
are solely determined by the product Ko Kd* They re- 
main unchanged regardless of the specific detail of F(s ), 
and hence the loop order. In particular, as far as the mod- 
ulator design is concerned, any spurious poles of F(s) or 
of any networks in the signal path from the phase detector 
output to the VCO input are lumped together with F(s) 
as part of an equivalent loop filter. This also applies to 
any time delay functions in that path. However, as will be 
shown in the experimental results discussed in Section IV, 
the design is not free of the effect of spurious poles in the 
loop that are not made part of the loop filter. 
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This scheme is ideal since H(s) is self-negating to 
unity as shown by Eqs. (8) and (10) and obviates the 
necessity for an accurate external loop compensation to 
H (s) or the need to compromise between the PLL and the 
modulator performance. 

Figures 3 and 4 show block diagrams of the proposed 
phase modulator and frequency modulator. It is evident 
from the diagrams that the frequency modulator can be de- 
rived from the phase modulator by inserting one integrator 
in each of the phase- modulating signal-injection arms (one 
of which nullifies the differentiator). 

The linearity requirement constrains the peak modu- 
lation index seen at the phase detector, hereafter denoted 
/? p . This means that the sum of the peak phase error in the 
absence of modulation and (3 p must be within the linear 
range of the phase detector. For a sinusoidal phase detec- 
tor transfer function, this sum should not exceed 30 deg 
for good linearity. Also, since the PLL is a carrier tracking 
loop, the design must prevent the range of modulation in- 
dex from causing a carrier-null at the phase detector input. 
The constraint on /? p can be mitigated by judicious choice 
of the phase compression factor N in the feedback loop 
that would permit a broader range of modulation index at 
the loop output. 

Linearity of the integrator and the differentiator also 
constrains f3 p . In the PM case, it can be shown that /? p 
must satisfy (3 p < KqVsat /(27r/ max ), where Vsat is the 
saturation voltage of the differentiator and f max is the 
highest input modulating frequency. Similarly in the FM 
case, 0 P must satisfy 0 P < Vsat/Kd , where Vsat is the 
saturation voltage of the integrator (here (3 P is the ratio 
of the maximum frequency deviation and the lowest input 
modulating frequency). 

It would appear from Eq. (11) for the FM case that 
it would be possible to apply modulation at very low fre- 
quency all the way down to DC. Then, one could generate 
a constant frequency shift at the VCO output by applying 
a constant DC voltage at the modulating input. However, 
a DC response is not possible in reality because the inte- 
grator (in Fig. 4) would eventually saturate. Secondly, the 
phase error would eventually exceed the dynamic range 
of the phase detector and the loop would then no longer 
remain linear or in lock. However, by careful design to pre- 
vent component saturation and to remain within the linear 
range of the PLL, it is possible to modulate the frequency 
as close to DC as desired. 


IV. Experiments 

A phase modulator implementing the proposed 
scheme was built to verify the analysis. It was designed 
to phase modulate a phase-locked signal source whose de- 
sign emphasis is for phase-noise and spurious signal power 
spectral density performance. A block diagram of the cir- 
cuit is shown in Fig. 5. Basically, the loop is a second- 
order PLL and is of the mix-down-and-divide design for 
low-noise performance. The VCO is a cavity oscillator 
operating at a frequency of 2290 MHz. The x24 mul- 
tiplies the 95-MHz phase-locked crystal oscillator output 
frequency to 2280 MHz. The 2280-MHz signal downcon- 
verts the cavity oscillator output at 2290 MHz to an IF 
of 10 MHz. The -r2 frequency divides the 10-MHz IF to 
5 MHz and effects a phase compression by a factor of 2. 
A temperature-stabilized low-noise crystal oscillator pro- 
vides the 5-MHz reference signal. From the phase-noise 
data of the 95-MHz phase-locked crystal oscillator and the 
free-running cavity oscillator, the loop parameters were 
designed for optimum phase noise performance at the out- 
put, with the constraint of adequate rejection of the 5-MHz 
reference signal feed-through. The loop natural frequency 
was selected to be about 35 kHz and the damping ratio to 
be 0.707. Measured gain constants of the sinusoid phase 
detector and the VCO are 0.30 volt/rad and 300 kHz/ volt, 
respectively, which yields the time constant 1/Ko Kd of 
1.77 /isec. An op-amp differentiator with a 3-dB band- 
width of about 14 MHz was used. A gain adjustment was 
included as part of the differentiator to effectively provide 
the adjustment of the time constant. 

To test the circuit, a baseband sinusoidal signal from 
a signal generator was applied to phase modulate the 2290- 
MHz carrier output. To assure linearity at the phase detec- 
tor and not to exceed the linear range of the differentiator 
at high modulation frequency, the modulation index was 
kept small, at about 3 deg. The output was then observed 
on a spectrum analyzer while sweeping the input modulat- 
ing signal frequency. The spectrum analyzer thus displays 
the magnitude of the modulation transfer function being 
measured at the carrier frequency. 

Figure 6 shows the result when the modulating sig- 
nal is applied to the loop filter input only. As would be 
expected, the output modulation spectrum is symmetri- 
cal about the carrier frequency. Note that in this case 
the display spectrum envelope is proportional to \H(jw)\. 
It can be seen that the actual natural frequency is about 
40 kHz, and the damping ratio is about 0.50. The 3-dB 
bandwidth of the loop is about 75 kHz. Figure 7 shows 
the result when the modulating signal is applied to the 
differentiator input only; here the spectrum envelope is 
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proportional to 1 1 — H(jw)\. These two cases correspond 
to the configuration in Fig. 1, and the results in Figs. 6 
and 7 demonstrate the band-limited nature of that circuit. 

Figure 8 shows the spectrum analyzer display when 
the modulating signal is applied to both inputs of the 
loop filter and the differentiator. Note that the previous 
high- and low-frequency roll-offs in Figs. 6 and 7 as well 
as the amplitude peakings occurring at the loop’s natural 
frequency have disappeared. The flat spectrum indicates 
that wideband frequency response has been achieved as 
predicted by the analysis. Figure 9 shows the same results 
but with the spectrum analyzer set at a 10-MHz span. As 
can be seen by comparing Figs. 6 and 8 (or 9), the phase 
modulation transfer function bandwidth has increased by 
more than an order of magnitude over that of the PLL. 
Specifically, the baseband modulation 3-dB bandwidth as 
shown in Fig. 9 is about 1.50 MHz in comparison with 
the 3-dB bandwidth of only 75 kHz for the loop as shown 
in Fig. 6. The rolling-off of the output modulation spec- 
trum in Fig. 9 (instead of its remaining flat as it should) 
was puzzling at first. It was later realized that this is 
because the driving circuit of the VCO varactor diode as 
seen from the output of the differentiator network approx- 
imates an RC lowpass function with a 3-dB bandwidth of 


about 1.50 MHz. This forms a spurious pole as discussed 
in Section III. Suffice it to say that this part of the cir- 
cuit must be designed properly or be lumped as part of 
the loop filter network in order to improve high frequency 
modulation performance. 


V. Conclusion 

A phase-locked angular modulator scheme has been 
proposed which has the characteristics of wideband mod- 
ulation frequency response. The modulator design is in- 
dependent of the PLL closed-loop transfer function H(s), 
thereby allowing independent optimization of the PLL pa- 
rameters as well as the modulator’s parameters. A phase 
modulator implementing the proposed scheme was built to 
phase modulate a phase-locked signal source that was de- 
signed for low-noise performance at 2290 MHz. The mea- 
surement results validated the analysis by demonstrating 
that the resulting baseband modulation 3-dB bandwidth 
exceeded that of the PLL by over an order of magnitude. 
With an additional implementation effort and measure- 
ments to improve the high frequency performance, it is 
expected that flat wideband modulation response from DC 
well into the MHz regions can be achieved. 
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Fig. 1. Typical phase-locked modulator [2]. 



Fig. 4. Proposed wideband frequency modulator. 



Fig. 2. Proposed phase-locked modulator. 
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Fig. 5. Experimental wideband phase modulator. 


Fig. 3. Proposed wideband phase modulator. 
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CENTER 2.289 997 GHz SPAN 1.00 MHz CENTER 2.289 997 GHz SPAN 1.00 MHz 
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Fig. 6. Output phase modulation spectrum without Fig. 8. Output phase modulation spectrum of the 

compensating differentiator. proposed wideband scheme. 
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Fig. 7. Output phase modulation spectrum with Fig. 9. Output phase modulation spectrum of the 

compensating differentiator only. proposed wideband scheme at 10- MHz span. 
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The Split-Symbol Moments Estimator (SSME) is an algorithm that is designed 
to estimate symbol signal-to-noise ratio (SNR) in the presence of additive white 
Gaussian noise (AWGN). This article examines the performance of the SSME al- 
gorithm in band-limited channels and quantifies the effects of the resulting inter- 
symbol interference (ISI). All resuits obtained herein are in closed form and can 
be easily evaluated numerically for performance prediction purposes. Furthermore, 
they are validated through digital simulations. 


I. Introduction 

The Deep Space Network Baseband Assembly (BBA) 
and Advanced Receiver (ARX) use a Split-Symbol Mo- 
ments Estimator (SSME) algorithm to estimate the sym- 
bol signal-to-noise ratio (SNR) of the received signal. The 
algorithm was designed to operate in the presence of ad- 
ditive white Gaussian noise (AWGN) and at data rates 
low enough so that bandwidth limiting is insignificant. Its 
principle of operation relies on using the outputs of two ac- 
cumulators each operating on a separate half of the same 
symbol. The product of the outputs provides, after av- 


eraging, an estimate of the signal power. An estimate of 
total power is obtained by summing both accumulations, 
thus integrating over the full symbol period, squaring the 
sum, and then averaging to obtain an estimate of signal 
mean-square plus noise variance. By processing both these 
averages, an estimate of the symbol SNR can be easily 
derived. 

This article examines the performance of the SSME 
when the data stream is filtered and inter-symbol interfer- 
ence (ISI) results. This is a likely scenario in the ARX II 
(current version of ARX) which has a 20-MHz processing 
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rate and a data rate goal of 6.6 Msymbols/sec. The ratio of 
the bandwidth W to the highest data rate ( R *) expected is 
W/Rh = 1.33 assuming an 8.8-MHz 3-dB telemetry band- 
width. Analysis and simulations are used simultaneously 
to quantify the performance of the SSME in band-limited 
channels. Throughout this article, perfect carrier, subcar- 
rier, and symbol synchronization are assumed, since the 
purpose of the analysis is to quantify the effects of filter- 
ing only. 


II. The Split-Symbol Moments 
Estimator (SSME) 

The input to the SSME consists of a sequence of sam- 
ples obtained by processing the received signal. The pro- 
cessing can be broken into two stages: analog and digital. 
In the ARX II, for instance, the received signal is first 
downconverted to an intermediate frequency and filtered 
using analog circuits. This is followed by an analog-to- 
digital converter after which the carrier, subcarrier, and 
symbol synchronization are performed digitally. It is as- 
sumed that the synchronization function is performed in 
an ideal fashion and is thus ignored in this article. Figure 1 
depicts a basic model for the telemetry signal processing 
up to the SSME input. The analog filtering function is 
lumped into a single equivalent filter H{s) and the dig- 
ital processing is represented by the filter H(z). Given 
symbol timing, the SSME estimates the symbol signal-to- 
noise ratio, which is then used to assess the performance 
of the overall receiver. Quantifying the performance of 
the SSME, especially at high data rates, enables the user 
to determine the source of erroneous SNR measurements. 
These erroneous measurements may be attributed to the 
fundamental limitations of the SNR estimator or to the 
various tracking loops not being in-lock. This is particu- 
larly important for monitoring the symbol synchronization 
loop, as the only indication of its proper or improper oper- 
ation is provided by the estimates of symbol SNR available 
from the SSME. In order to simplify the problem, it is as- 
sumed that the digital filter H(z) is dominating and the 
effects of the analog filter are ignored. This is not the case 
in the ARX II, as both filters contribute equally. However, 
dealing with one filter will provide some insight into the 
behavior of the SSME in the presence of ISI. 

A. Signal Model 

Assuming that the carrier and sub carrier synchroniza- 
tion have been performed ideally, the baseband signal r(f) 
can be modeled as 


r(t) = y/(S)d(t) + n(t) (1) 


where S is the data power and d(t) denotes the actual data 
given by 

oo 

d(t)= £ d n p(t — nT) (2) 

n = — oo 

where p(t) is the baseband pulse limited to T sec (data 
rate R = 1/T) and d n denotes the binary symbols ±1. The 
additive noise n(f) is zero-mean white Gaussian noise with 
two-sided power spectral density (PSD) No/2 watts/Hz. 
The continuous signal r(t) is then sampled once every T s 
seconds (sampling rate f s — 1/T,) to produce the sequence 
r,j, as shown in Fig. 2. Consequently, the samples are 
given by 

r ij = \ZiS)dij + n >j (3) 

where dij takes on the value ±1 with equal likelihood and 
nij is a sequence of independent identically distributed 
zero mean Gaussian random variables with variance 
Nq/ 2T s . The double subscript ij denotes the ith sample 
of the jth symbol. In this case the digital samples r ^ are 
then filtered by a general filter H{z) to produce the string 
of samples y,-j, which form the input to the SSME. The 
digital filter H(z) can be either a FIR or an HR filter. In 
either case, the analysis to follow is applicable. Assume an 
integer number N s of samples per symbol. This is a valid 
assumption when the sampling clock is controlled by the 
symbol synchronization loop to produce an even number 
of samples per symbol as in ARX I, (the previous version 
of the advanced receiver) [4]. However, in ARX II this is 
not the scenario, as the sampling clock is free running and 
is not synchronized with the symbol rate. As a result, the 
number of samples per symbol can change and hence, is 
not fixed. It is not the goal of this article to investigate 
the effects of noninteger samples per symbol on the per- 
formance of the SSME. Thus, only the integer number of 
samples per symbol case is considered. In writing Eq. (3), 
it is further assumed that the pulse p(t) is an NRZ or a 
Manchester pulse. In either case, dij remains constant (1 
or —1) for N s or N s /2 samples. By including an additional 
weighting factor, more general pulse shapes can also be ac- 
commodated. Note however that when the baseband pulse 
p(t) contains nonzero rise or fall times due to analog fil- 
tering, the sampling offset (defined as the distance in time 
between the start of the pulse and the time at which the 
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first sample is taken) becomes an additional variable and 
should be considered in a performance prediction analysis. 


B. The SSME Structure 

In this section, the SSME is introduced and discussed 
briefly. Key performance parameters are also defined to 
evaluate the SSME performance in the presence of ISI. To 
the authors’ knowledge, the SSME was first suggested by 
L. Howard [1] and later analyzed in white Gaussian noise 
by Simon and Mileant [2]. 

The SSME structure is depicted in Fig. 3. The up- 
per “arm” sums the input samples yij over the first half 
of a symbol to produce Y*i, and the lower “arm” sums 
the input samples over the second half of a symbol to pro- 
duce Y 0j . Subsequently, Y a j and Ypj are first summed and 
squared to produce X ss j , an estimate of total power, and 
secondly multiplied to yield X p j , an estimate proportional 
to signal power. Next, X ss j and X p j are averaged over n 
symbols to obtain better estimates, namely, m ss and m p . 
Finally, m p is appropriately weighted and subtracted from 
m ss to obtain an estimate of noise power. This is then 
used with the signal power estimate to obtain an estimate 
of the symbol signal-to-noise ratio. 

In order to evaluate the performance of the SSME, 
the SNR should be defined at various points in the system 
(see Fig. 2), in particular, the symbol SNR before the filter 
and the symbol SNR at the output of the filter which forms 
the input to the SSME. Obviously, it is expected that the 
SNR estimates of the SSME will coincide with the symbol 
SNR of the filtered signal, as the latter constitutes the 
input to the SSME. As the data rate is increased and the 
number of samples per symbol is reduced (assuming an 
approximately fixed sampling clock), it is expected that 
the SSME estimates will diverge from the symbol SNR; 
i.e., there is a data rate threshold above which the SSME 
breaks down. 

The symbol SNR most commonly used in telecommu- 
nications is defined as the ratio of symbol energy E s to the 
one-sided PSD level No, i.e., SNR = E s /No. For rectan- 
gular symbols, E 8 — ST. This can also be expressed in 
terms of the symbol noise variance as SNR = S/2<r„ where 
= No/2T . In terms of Fig. 2, this is the symbol SNR 
at the input to the filter. The symbol SNR at the output 
of the sum- and-dump filter, denoted by SNR f (' denotes 
filtered), can be approximated by 


SNR' « 



(4) 


where S' and <j\, denote the average signal power and the 
noise variance at the output of the sum-and-dump filter, 
respectively. Equation (4) is an approximation because it 
does not include the “self noise” due to ISI (denoted 0 / 5/ ) 
from adjacent symbols. An exact definition of SN R' would 
include <t] si in the denominator of Eq. (4). However, in 
the case of low SNR, <t\i is much larger than ° J 5/ 1 and 
the latter can be ignored. On the other hand, at high 
SNR, a] SI dominates and Eq. (4) is invalid. In DSN ap- 
plications, the signal is typically dominated by noise and 
not by ISI, hence, Eq. (4) is valid. Also note that the 
sum-and-dump filter is not a “true” matched filter. As a 
result, SNR' is not optimized. It is shown in Section I of 
the Appendix that SNR' as approximated in Eq. (4) can 
be expressed as 


SNR' = 


oo /N s - 1 
*=- oo \ *=0 


s £ £ h p (i + kN , ) 


N s - 1 N ,- 1 

2 II Rn '^ - 

* = 0 m = 0 


(5) 


where h p (i), i = 0, 1 , . . ., is the pulse response of the digital 
filter and R n '(k) is the autocorrelation function of the fil- 
tered noise n'(m), defined by Rn*(k) = E[n'(m)n'(m + k)\. 
The indices of h p ( ) and R n ( ) correspond to sampling 
invervals. The double summation in the denominator ac- 
counts for the correlation between the ith noise sample and 
the mth noise sample of the same symbol. The summation 
over k in the numerator incorporates the contribution of 
ISI from all symbols. Strictly speaking, this sum is infinite 
for an HR filter and finite for an FIR filter. In either case, 
the sum can be truncated to include only those symbols 
whose ISI contribution is significant. In the absence of fil- 
tering, h( 0) = 1 and h(i) = 0 for i ^ 0. Consequently, 
the autocorrelation R n '(k) = ( No/2T s )[S(k ) * h(k) * /i(fc)] 
simplifies to (No/2T s )S(k) and Eq. (5) reduces to ST/N 0 , 
as expected. 


C. The Performance of the SSME Algorithm 

In this section, the performance of the SSME is evalu- 
ated in the presence of ISI. All results derived herein reduce 
to their counterpart for a wideband channel as in [2]. The 
input samples to the SSME are given by 


oo 

yij = VS dj- m h p (i + mN,) + n' ; - (6) 

m= — oo 
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where are the filtered noise samples. We chose to rep- 
resent yij in terms of the pulse response h p (n) to save the 
additional summation present when written in terms of the 
impulse response fi(n). In any case, h p (n) is related to the 
impulse response h(n) via 


Ns - 1 

M») = Y A ( n ~ m ) ( ? ) 

m=0 

From Fig. 2, the input samples are summed over 
the first half of a symbol to form the random variable n* 
samples in the second half form Ypj , i.e., 


Note that m 88 and m p change at one nth of the data rate 
and n is a design parameter. Finally, an SNR estimate, 
SNR *, is obtained by scaling and combining m p and m 88 
as follows: 


SNR* 




2 (|m„ - m p ) 


( 11 ) 


Equation (11) describes the SSME SNR estimator which 
has been designed to operate in a wideband channel. De- 
pending on its performance in the presence of ISI, modi- 
fications might be required to adopt the estimator for use 
in band-limited channels. 


(Ns/ 2)-l 

Y*j = ^2 

i = 0 


(8a) 


The mean and variance of SNR * have been derived 
in [2] and are repeated below for convenience, 


N $ - 1 

Y 0j = Y w 


i=N,/2 


(8b) 


E [SNR*] = SNR + i (l + 2SNR ) (12) 


As discussed earlier, we form from the half-symbol accu- 
mulators the product and the sum squared of the outputs 
to get 


X s . j =(Y aj +Y /}j ) 2 (9a) 

X P j=Yp r Y aj (9b) 


Var [SNR*] - ^ (l + 4 SNR + 2 SNR 2 ) 


where 


SNR = 


E\™p\ 

2 [\E[m„] - E[m p ]\ 


(13) 


(14) 


These variables are the estimates of total power and signal 
power on a per symbol basis. To improve these estimates, 
we average over n symbols and obtain 

m " = \Y x »i ( 10a ) 

j= i 

m P = iY x p> ( 10b ) 


Equations (12) through (14) have been computed in the 
presence of AWGN and assume no ISI. Furthermore, they 
are approximate expressions which are valid for large n. 
Improved approximations could be obtained by retaining 
higher-order terms of the expansion as discussed in [3]. In 
the presence of filtering, these equations are not valid in 
the strict sense but can still serve as a guide, the quality 
of which is assessed via computer simulations at a later 
point. They are expected to be “good” approximations 
because we are operating in a noise-dominant rather than 
ISI-dominant scenario. It is shown in the Appendix that 


E[m p ] = S Y 

m= — oo 


(JV,/2)-l 

h p (i + mN.) 

1=0 


Ns - 1 

Y h p( k + mN >) 


k=N,/2 


(N,/ 2)-l N,-l 

+ Y Y *»'« -* ) 

1=0 k=Ns/2 


(15a) 
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and 


E[m„] = S 53 


Ni—l 

53 h p {i + mN,) 
L *=o 


T 2 


Ns - lNs -1 

+ E E «•■«-*) 

•=0 *=0 


(15b) 


which gives 


SNR—— 


oo 

*E 

— oo 

’(Ns/2)-l Ns- 1 

^ /i P (i + mJV 5 ) ^ /i p (HmiV,) 

i=0 k-Ns/2 

(N s / 2)-l N a - 1 

+ 53 E 

» = 0 k-Ns/2 

2 i 

4 

\ ms- oo 

'N a - 1 

+ mA/*) 

. »=o 

Ns-lN a -l \ 

2f53 53« n -(^) 

1 = 0 *=0 / 

oo An,/2)-1 .N ,- 1 \ 

-553 ( 5 v />p('+ m ^) 53 m* +»**.> 

m= — 00 y »=0 At=Nj/2 j 

(JV,/2)-l JV.-l 

- E E H "' (, ’ fc) 

.=0 fc=Nj/2 


(16) 


All these equations reduce to their respective counter- where 
parts in a wideband channel as in [2]. Namely, E[m p ] = 
N*S/4,E[m„] = AT, 2 5 4* N S N 0 /2T S and SNR = 5T/A 0 . 

Since the estimate 5iV/Z* itself is a random variable, a 
measure of its quality is its signal- to-noise ratio given by 


(N$/ 2)-l Ns — 1 

Ao= E E *»'(*-*) 

i'=0 k=N,/2 


(19a) 
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Using the approximations for £[SN/i*] (Eq. 12), and 
VartSW#*] (Eq. 13), one obtains 


a 00 = s 53 
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(19c) 
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Note that in either limit, as SNR approaches 0 or oo, one 
has 


Boo = s 53 


N,-l 

53 h p (i + mN,) 
i = o 


(19d) 


and 




SWKi = MPI^) i = 0 '“ (18) 


lim SNR(SNR*) 
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is given by Eq. (17b) with SNR replaced by its respec- 
tive limit. In the absence of filtering, Ao = 0, Bo = 
(N s No/2T s ), Aoo = (N s /2) 2 and = N 2 } which simpli- 
fies to 


lim SNR(SNR m ) = - (20a) 

SNR-0 v n 

lim SNR(SNR*) ~ J + 2 (20b) 

SNR— oo 2 


in agreement with [2]. Equations (18) and (19) depend 
on the filter# (z) and cannot, in general, be simplified any 
further. Intuitively, however, SN R(SN R*) is expected to 
improve with n for high SNR even in the presence of ISI. 

III. Numerical Results and Discussion 

As mentioned earlier, the performance of the SSME 
needs to be assessed in the presence of ISI for various data 
rates. The digital filter used in the simulations is the half- 
band filter described in [5] with transfer function H(z) 
given by 

H(z) = - 0.017 + 0.038z -2 - 0.089z~ 4 
+ 0.312z -6 + 0.5z -7 + 0.312z -8 
- 0.089z -1 ° + 0.038z -12 - 0.017z -14 (21) 


and plotted in Fig. 4 versus normalized frequency. Note 
that the ratio of 3-dB bandwidth to sampling frequency is 
about 0.23. Assuming an effective sampling rate of 20 MHz 
(actual sampling rate is 40 MHz but telemetry process- 
ing at the output of the half-band filter is performed at 
20 MHz due to decimation by two), the number of sam- 
ples per symbol is given by N s = f s T. Thus, defining N s is 
equivalent to specifying the data rate (for a fixed sampling 
rate). To get a feeling of the filtering involved, Fig. 5 de- 
picts the pulse response and Fig. 6 shows a typical filtered 
sequence for N s = 2, 4, 10 in the absence of noise. Note 
from Fig. 5 that a filtered pulse experiences interference 
from both past and future symbols due to the inherent 
filter delay. 

The performance of the SSME in the presence of fil- 
tering is depicted in Fig. 8 versus the number of samples 
per symbol N s . The figure shows analysis and simulation 
results for input SNRs of —5, 0, and 5 dB. As N s de- 
creases, the data rate increases and more ISI results. As 
N s decreases below 6 , SNR * diverges from SNR' and the 
SSME breaks down. Surprisingly, the estimated SNR * 
are several decibels higher than SNR' in the breakdown 
region. However, it is unclear whether the divergence of 
the estimates is due to the ISI or colored noise. In order 
to gain more insight into the causes, the effects of ISI and 
colored noise are considered separately. Figures 10 and 12 
depict the SSME response when the data or the noise are 
filtered, but not both. When the data is filtered (Fig. 10), 
the estimated SNR, denoted by SNR d , is still valid even 
for N s < 6. In this special scenario, Eqs. (5) and (16) 
reduce to give 
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and 
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The subscript d is to indicate that only the data is filtered. 
Both equations are shown in Fig. 10 along with simulation 
points. As N s decreases, SNR f d and SNR* d decrease be- 
cause more signal energy is lost due to filtering. Also note 
that at N a = 2, SNR * and SNR f coincide better at low 
SNR and they diverge for high SNR. The curvature of the 


graph in Fig. 10 is opposite to that of Fig 8. That tends 
to indicate that ISI itself is not the reason why SNR * of 
Fig. 8 increases for low N s . This phenomenon is better 
explained by Fig. 12, which depicts the SSME response in 
the presence of correlated noise, but unfiltered data. In 
that case, Eqs. (5) and (16) simplify to 


SNR' = 


SN? 

N s - 1 N s - 1 

2 E E Rn'(i-m) 

* = 0 m=0 


(23a) 
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where the subscript n indicates that only noise is filtered. 
For N $ < 6, the SSME seems to break down due to noise 
correlation. Recall that the SSME estimates noise power 
by subtracting signal power estimate from total power esti- 
mate. The signal power estimate X p j (see Fig. 3) is formed 
by the product of the accumulator outputs. In the absence 
of filtering, that product contains independent noise sam- 
ples whose effect is reduced by averaging over n symbols. 
In the presence of filtering and specifically for low N a , the 
noise samples in that product are highly correlated and 
contribute to the signal power estimate even after averag- 
ing. That in turn reduces the noise power estimate after 
subtracting from the estimate of total power, hence signif- 
icantly increasing SNR£. Obviously, the noise correlation 
also increases the total power estimate but that increase is 
still less than its counterpart in estimating signal power, 
taking into account the appropriate weighting. In Figs. 13 
and 14, SNR * is plotted versus SNR (Fig. 13) and SNR' 
(Fig. 14) for a fixed N s when both the data and noise are 
filtered. As expected, a linear relationship is obtained for 
large values of N $ and a nonlinear relationship as the data 
rate increases. 

Finally, as an indication of the quality of the esti- 
mates, Figs 15 and 16 depict SNR(SNR*) versus SNR for 


a fixed N s (Fig. 15) and fixed n (Fig. 16). As more symbols 
are used to provide an SNR estimate, an improved perfor- 
mance is expected and this is clearly shown in Fig. 15. 
From Fig. 16, it is also clear that SNR(SNR*) improves 
with decreasing N s . This is a rather surprising result at 
first glance. But remember that SNR(SNR*) is an indica- 
tion of how the estimate jitters about its mean, no matter 
where the mean is. Also recall that for N s = 4 the defi- 
nition of SNR in Eq. (4) is invalid as the variance due to 
ISI might be significant. The simulations were conducted 
using 25,000 symbols for N s > 10 and 50,000 symbols for 
N 8 < 10, with varying n depending on the SNR. 


IV. Conclusion 

This article deals with the performance of the SSME 
in the presence of ISI, evaluated both analytically and by 
simulation. It is shown that the estimator performs well 
with more than 6 samples per symbol, but poorly with 
fewer than 6 samples per symbol due to the high corre- 
lation among noise samples of the same symbol. Further 
work would be required to reduce the bias in the estimates 
that occur with few samples per symbol. 
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Fig. 1. General signal processing model. 



Fig. 2. Signal model. 



Fig. 3. The SSME structure. 



Fig. 4. Digital filter frequency response (20-MHz 
processing rate). 




NORMALIZED TIME (t/T) 


Fig. 5. Pulse response of the half-band filter. 
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Fig. 10. SNR. SNR'd and SNR d versus N s (data filtered). 



SNR 


Fig. 11. System model for Fig. 12. 















Fig. 16. SNR(SNR*) versus SNR (varying N Sl fixed n). 
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Appendix 

Derivation of Equations 


I. Derivation of Equation (5) of the digital filter. Let s' ; - be the signal portion of y t j. 

Clearly then, 

The approximate symbol signal-to-noise ratio at the 
output of the digital filter, SNR', is given by 


SNR! = 



OO 

(A-l) 8 , ij = VS E dj- m h p {i + rnN,) i = 0, 1 , 2, ..., N, — 1 

m=—oo 

(AS) 


where S' and cr\, denote the average signal power and the 
noise variance respectively at the output of the sum-and- 
dump filter. We begin our derivation of the signal power in 
the filtered symbol with an expression for the ith sample 
of the jth filtered symbol, . Namely, 


Vij = Vs\djh p (i) + dj-ih p (i + N,) 


and the signal power, S', is 


S' = E 



(A-4) 


-hdj_2^p(* + 2Af,)J + V n ij 

t = 0 , 1 , 2, . . . , AT, — 1 


where E[ ] is the expected value operator to average over 
the random data. Substituting Eq. (A-3) into Eq. (A-4) 
yields 


OO 

= Vs E dj_ m hp(i + mN,) + n' ; 

m= — oo 

* = 0, 1,2, .... N.-l (A-2) 


S' = E 


( N,-l oo \ 2 

VSJ2 E di- m hp(i + mN .) ) 

t=0 m=-oo / 


where dj is the jth received symbol, h p (i) is the pulse Separating the above equation into four sums and moving 

response at time *, and is the noise sample at the output the expected value operator inside the summations yields: 


S' 


N t -1 N 8 -l oo 
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where the last step follows since E[dj- n dj- m ] = 6(n — m) 
for all j. Using Eq. (7), S' can be expressed in terms of 
the impulse response as 



h(i + nNt 




(A-6) 


Substituting for y S j from Eq. (A-2) in Eq. (A-9) gives 


E[X tgJ ] = E 


N s - 1 / oo 

E dj-mhp ( i + mN s ) 

t=0 \m= — oo 





(A-10) 


The filtered noise variance on a per symbol basis, is 
defined as the expected value of the squared sum of N, 
noise samples, i.e., 



Ns-lNs-l 

= E E *»'(*-*) 

t=0 /=0 


(A-7) 


where is the autocorrelation function of the filtered 

noise samples. Using Eqs. (A-5)and (A-7) in Eq. (A-l), 


SNR! = 
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II. The Expected Value of X ss , X pt m ss , 
and nip 

As indicated in Fig. 3, the random variable X 99 j is 
the squared sum of N 9 input samples. Hence, the expected 
value of X 89 j is 


Carrying out the square in the above equation, moving 
the expectation operator through the summations, and 
noting the following: = S(m — £) for all j, 

E[ a j-m n ij\ = 0 for any i, j, and m, and £[n^n^] = 
Rn*(i — it) for all j, the desired result is obtained, namely, 


oo 

E[X„i\ = S £ 

m= — oo 


■JVj-1 


TV. — 1 JV.-l 

+ ££*»'(»-*) ( A - n ) 

*=o *=0 


Note that is independent of j and henceforth the 

subscript j can be dropped from E[X 99 j] and the expected 
value of X 99 j can be denoted by X 99 . As shown in Fig. 3, 
the random variable Xri is the product of the sum of sam- 
ples from the first half of a symbol with the sum of samples 
from the second half of a symbol. Consequently, the ex- 
pected value of X p j is given as 


E[X pj ] = E 


(N,/ 2)-l N,-l 

£ 

1=0 lc=N,/2 
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E[X,.j] = E 
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Substituting for from Eq. (A-2), 
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Expanding the product in the equation above and taking 
the expectation of the received data symbols, filtered noise, 


as well as the product of data and noise yields the desired 
result: 


E[Xpj) = S £ 

m=— oo 


'(Af,/2)-l N,-l 

53 hp(i + mN,) 53 h{ k + mN ») 

i=0 k=N,/ 2 
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Note that E[X p j] is independent of j and henceforth 
the subscript j can be dropped from E\X p j] and the ex- 
pected value of X p j can be denoted by X p . The random 
variables m S8 and m p are formed by scaling and summing 
the random variables X st j and Xpj over n symbols. Con- 
sequently, the expected values of m st and m p are the ex- 
pected values of these scaled sums. But, since E[X ss j] and 


E[X p j] are 

constant from symbol to symbol (i.e. 

, indepen- 

dent of j), 




E[m, a ] = X„ 

(A-15a) 

and 




E[m p \ = X p 

(A-15b) 
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This article compares the measured DSN 70-meter antenna performance at S- 
and X-bands with the design expectations presented in Part I of this report. A 
discussion of natural radio-source calibration standards is given. New estimates of 
DSN 64-meter antenna performance are given , based on improved values of cal- 
ibration source flux and size correction. A comparison of the 64- and 70-meter 
performances shows that average S-band peak gain improvement is 1.94 dB , com- 
pared with a design expectation of 1.77 dB. At X-band, the average peak gain 
improvement is 2.12 dB, compared with the (coincidentally similar) design expec- 
tation of 1.77 dB. The average measured 70-meter S-band peak gain exceeds the 
nominal design-expected gain by 0.02 dB; the average measured 70-meter X-band 
peak gain is 0.14 dB below the nominal design-expected gain. 


I. Introduction 

This article is the second part of a two-part report [1] 
detailing the design performance expectations and com- 
parison with actual performance of the DSN 70-meter an- 
tennas in their upgrade from the original 64-meter con- 
figuration. The actual 70-meter antenna gain and noise 
temperature performance is documented in [2, 3]. 

Two of the important elements of the 64-70 meter an- 
tenna performance upgrade project were (1) to determine 


if the project specifications were met and (2) to quan- 
tify, with an estimated absolute accuracy, the actual ef- 
ficiency/gain levels achieved. The project specifications 
of 1.9-dB X-band and 1.4-dB S-band gain increases are 
“improvement” specifications and do not effectively quan- 
tify absolute performance levels; there remains the issue 
of quantitatively defining the baseline performance level 
of the 64-m antennas. In this situation, the DSN had 
three differently performing 64-m antennas [4-6]. For ex- 
ample, it was clear (but not specifically measured) that 
the Goldstone 64-m antenna had reflecting panels which 


were manufactured and/or adjusted better than those on 
the Madrid and Canberra 64-m antennas. Also, the Gold- 
stone and Canberra 64-m antennas had been previously 
upgraded with a gravity-distortion-reducing major struc- 
tural brace system which generally performed successfully, 
but was not accompanied with the exact required subre- 
flector active focus command programs. The Madrid 64-m 
antenna was further unique in that the structural brace 
upgrade was installed but certain superfluous older brac- 
ing was intentionally not removed due to time constraints. 
This combination was known to be counter-productive. 
Because of noncritical tracking requirements, this was an 
acceptable configuration for a short interim period prior 
to planned 70-m upgrade but, in fact, it produced an an- 
tenna system with degraded shape and nonoptimum sub- 
reflector focus performance. In short, the 64-70 meter 
upgrade project began with three quite differently per- 
forming “baseline” 64-m antennas, and these needed to 
be carefully calibrated to assist settling project baseline 
definition questions. 


II. Natural Radio Source Calibration 
Standards 

DSN antenna calibrations are wholly dependent upon 
viewing natural radio noise sources (radio galaxies, 
quasars) with estimated microwave flux densities. Ob- 
taining such calibrations by the alternative of overall deep 
space telecommunication link performance has proven a 
more difficult matter because of the great number of vari- 
ables involved. The absolute accuracy of natural radio- 
source flux densities has been estimated to be 0.50 dB, 
(3-cr) at X-band [4]. It was believed even with the difficult 
absolute accuracy issue at the half-dB level, at least very 
accurate comparison measurements would ensue, and the 
70-m calibrations would be no worse on an absolute basis 
than the 64-m calibrations. 

The 70-m antennas provided more detailed radio- 
source observation by virtue of much narrower antenna 
X-band beamwidth (0.0305 deg versus 0.036 deg). Nar- 
rower beamwidth results from both larger diameter and 
substantially higher area efficiency. Various sources which 
formerly appeared consistent in strength on the 64-m and 
smaller DSN antennas immediately appeared with incon- 
sistent apparent strengths on the new 70-m antennas — an 
indication of source size/beam width (source resolution) 


increase. Although never an issue at S-band, the DSN ra- 
dio source list “standards” document 1 in fact was shown 
inadequate at X-band for the very narrowbeam 70-m. The 
approximately 15-percent narrower X-band 70-m beam- 
width is about 0.0305 deg or 1.83 arcmin. This proved 
to better resolve the primary calibrator 3C274 and re- 
veal the previous inaccuracies, especially in the source 
size/beamwidth correction factor, but to a lesser degree 
in the absolute flux scale as well. 

Table 1 provides a summary of data corrections 
needed when using natural radio sources with these ex- 
tremely large, high-efficiency X-band antennas. The res- 
olution provided by significantly sharper (high-efficiency) 
70-m beamwidth, together with careful analytical work at 
JPL (M. Klein and P. Richter, private communication), re- 
vises the primary and secondary DSN calibrator sources, 
as detailed in Table 1. The effects on gain are shown in 
parentheses. 

Updated Table 1 values, and others, will be published 
in the Revision C version of JPL Document D-3801. Ac- 
cordingly, within this article, previously published [4-6] 
64-m X-band results are rescaled according to Table 1 val- 
ues. The 70-m X-band results herein are also based on 
Table 1. At S-band, due to wider beamwidth, no notice- 
able source size/beamwidth or flux effects were expected 
or determined. At S-band, previous DSN calibrations with 
National Bureau of Standards (now NIST) certified stan- 
dards have proven that natural radio-source S-band flux 
values are known to 0.22 dB, 3-cr [7]. 

A rigorous analysis of absolute accuracy presently at- 
tained in DSN antenna gain measurements using natural 
sources is not available. The three dominant error contrib- 
utors are natural source flux, instrumentation accuracy, 
and data repeatability. Tests for and corrections of these 
errors are applied in data reduction for pointing errors as 
needed and receiver linearity. It is believed the S-band 
flux accuracy is 0.22 dB and that the X-band accuracy is 
improved, due to the 70-m narrowbeam experience, to not 
worse than 0.40 dB, both numbers quoted at high (3-cr) 
confidence. It is believed that the instrumentation accu- 
racy is 0.15 dB, and that data repeatability is generally 
0.09 dB, both 3 -<r. Taking these dominant error sources 

1 DSN Radio Source List for Antenna Calibration, JPL D- 
3801, Rev. B (internal document), Jet Propulsion Laboratory, 
Pasadena, California, September 25, 1987. 
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as independent, it is reasonable to accept overall abso- 
lute accuracies of 0.28 dB (S-band) and 0.44 dB (X-band), 
3-cr, for normally appearing data sets. At both frequency 
bands, it is significant that the natural-source flux uncer- 
tainty is the dominant component of the total measure- 
ment uncertainty. 


III. Adjustment of 64-m Antenna 
Calibrations 

All three 64-m antennas were calibrated at S- and X- 
bands within a month or two prior to removal from service 
for 70-m conversion. DSS-63 was calibrated in July 1986; 
DSS-43 in January 1987; and DSS-14 in September 1987. 
The JPL DSN radio-source flux standard D-3801, Rev. A, 
was in effect throughout the DSS-43/63 period, and Rev. B 
during the DSS-14 period. Consistency in the use of equip- 
ment, technique, and standards was sought and is believed 
to have been achieved. The results are initially reported 
in [4-6]. 

In view of the natural radio source size and flux cor- 
rections, the 64-m calibrations are now adjusted in order 
to allow accurate comparison with the 70-m results. DSS- 
14 64-m X-band results are increased -fO.313 dB, chang- 
ing the X-band peak efficiency and gain values in [6] from 
0.498 (+72.01 dBi) to 0.535 (+72.33 dBi). DSS-43 is ad- 
justed +0.329 dB, changing the X-band peak efficiency 
and gain values in [5] from 0.454 (+71.61 dBi) to 0.490 
(+71.94 dBi). DSS-63 is also adjusted +0.329 dB, chang- 
ing the X-band peak values in [4] from 0.451 (+71.58 dBi) 
to 0.487 (+71.91 dBi). All X-band results apply at a fre- 
quency of 8420 MHz, and the peak-to-peak spread, 
0.42 dB, was rather large. 

At S-band, the 64-m antennas yielded an average of 
58.2 (+1.2/ - 0.8)-percent area efficiencies (61.36 dBi at 
the test frequency of 2285 MHz, or 61.40 dBi adjusted 
to 2295 MHz for comparison with 70-m results). DSS-14 
peaked at 59.4 percent, DSS-43 at 57.7 percent, and DSS- 
63 at 57.4 percent, a peak-to-peak spread of 0.15 dB. 


IV. 64-m Calibrations Compared 
With Design Expectations 

Figure 1 shows the 64-m antenna X-band gain with el- 
evation angle response, adjusted as described above. Both 


DSS-14 and DSS-43 exhibited generally flat response, not 
unlike the design-expected shape, albeit everywhere be- 
low the expectation. DSS-63 clearly exhibited the interim 
dual-brace arrangement, known to degrade antenna gain 
response away from the setting angle. The reader is re- 
minded that the 64-m design-expected peak value is based 
on 1.14-mm rms (normal) small-scale surface roughness. 
This was the original design value and was likely never 
achieved. Limited holographic imaging on the DSS-63 64- 
m antenna indicates the small-scale effective roughness was 
near 1.7-mm rms, consistent with the actual performance 
described in [4], although the DSS-14 64-m antenna was 
clearly better, perhaps 1.35-mm normal rms. 

Two X-band design expectations were discussed in 
Part I [1] of this article. The first is considered a “nomi- 
nal” expectation, in terms of likelihood, and the second is 
considered an “estimated minimum-likely” value. Both are 
highly dependent on the small-scale roughness term, which 
was poorly known for the 64-m antennas. The procedure 
adopted to form the estimated minimum-likely value, as 
discussed, takes one-third of the estimated peak limits. 
This is equivalent to the nominal estimate, minus approx- 
imately l-<7. In the 64-m case, this is believed to lead to 
overestimating the real performance to be expected. If 
high-resolution holographic imaging of all 64-m antennas 
was available, the estimates could be improved; however, 
such imaging is not available on all those antennas. Ac- 
cordingly, in the absence of further quantitative informa- 
tion, the original design rms, the nominal design-expected, 
and the estimated minimum-likely 64-m efficiency values 
must be accepted. Comparisons with measurements are 
shown in Table 2. 

The S-band average measured difference from esti- 
mated minimum-likely is 0.07 dB. The X-band difference 
is 0.05 dB at DSS-14 and about 0.38 dB at DSS-43 and 
DSS-63. Clearly, DSS-43 and DSS-63 had surfaces whose 
roughness was larger than both the original specification 
and the one-third peak value adopted in the minimum- 
likely estimating procedure. It requires 1.24 dB of small- 
scale roughness loss to reconcile DSS-43/63 X-band 64-m 
actual performance with expectations. This indicates an 
effective normal surface roughness of 1.6 mm (0.063 inch), 
not an unreasonable value. At S-band, it should be noted 
that the DSS-14 64-m antenna in particular met and ap- 
peared to slightly exceed the estimated minimum-likely 
efficiency (0.594 versus 0.591). 
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V. 70-m Antenna Calibrations 

Immediately upon conclusion of final upgrade struc- 
tural work, the 70-m antenna calibrations commenced. In- 
clement weather at DSS-63, the interim fiberglass subre- 
flector at DSS-43, and other factors delayed completion 
until August 1988. Early informal performance report- 
ings, especially for X-band, were gauged against the then- 
existing D-3801, Rev. B flux standard document, even 
though the very narrowbeam 70-m antennas immediately 
revealed inconsistencies among the X-band calibration 
sources. In the following discussion, the revised flux and 
source size/beamwidth corrections discussed earlier are 
properly used. The 70-m X-band peak efficiencies and 
gains as reported in [2] are: DSS-14, 68.4 percent (74.17 
dBi); DSS-43, 67.3 percent (74.10 dBi); and DSS-63, 70.1 
percent (74.28 dBi). These values are more closely grouped 
than the three 64-m antenna X-band performances were. 
The DSS-14 peak occurs at 46-deg elevation, DSS-43 at 
46-deg elevation, and DSS-63 at 47-deg elevation. 2 The 
standard deviation of DSS-14 efficiency data (318 points) 
is 0.7 percent, or 0.04 dB. The DSS-43 data standard de- 
viation (540 points) is likewise 0.7 percent. The DSS-63 
standard deviation (176 points) is 1.0 percent, or 0.06 dB. 

The 70-m X-band gains as functions of elevation an- 
gle are shown in Fig. 2. The design-expectation curve 
has a peak value as determined in Table 4 of [1] and a 
shape determined from GTD calculations, which consider 
main reflector structural deformation due to gravity load- 
ing. The DSS-14 elevation function is seen to follow the 
design-expected elevation function, albeit 0.15 dB low at 
the adjustment angle and less than 0.2 dB low from 0 to 
80-deg elevation. Both DSS-43 and DSS-63 exhibit eleva- 
tion functions which are considerably steeper than design 
expectations. 

In view of the very small standard deviations in the 
measured data, the measured X-band elevation functions 
are judged statistically significant and indicate a real dif- 
ference in structural or focus performance among the 70-m 
antennas. It should be noted in Fig. 2 that small atmo- 
spheric loss terms have been removed from the observed 
data, according to on-site meteorological measurements 
taken during the tests. In short, the data in Fig. 2 are 

2 The main reflector surfaces were adjusted based on holo- 
graphic imaging as follows: DSS-14 at 47.0 deg, April 1988; 
DSS-43 at 47.3 deg, June 1988; and DSS-63 at 41.8 deg, July 
1987. 


as self-consistent as can be expected, and the DSS-43 and 
DSS-63 antennas indeed behave alike and somewhat differ- 
ently with elevation angle than does the DSS-14 antenna. 

At S-band, limited 70-m data were collected, in fa- 
vor of the more revealing X-band data. At S-band, the 
peak efficiency is 0.761 (+63.34 dBi at 2295 MHz), and 
the elevation response is essentially flat, as expected. Fig- 
ure 3 shows the average result from two antennas, DSS-14 
and DSS-43. The DSS-63 S-band 70-m data were self- 
inconsistent for reasons not understood, therefore judged 
invalid. Operationally, it is not expected that DSS-63 will 
perform differently from the other stations at S-band, and 
indeed no significant difference has been observed. 


VI. 70-m Calibrations Compared With 
Design Expectations 

Table 3 summarizes the nominal design-expected and 
estimated minimum-likely 70-m efficiency values and mea- 
surements. The “best” X-band antenna, DSS-63 with 
0.701 efficiency, is 0.09 dB above the measurement aver- 
age while the “worst” antenna at 0.673 efficiency is 0.08 dB 
below the measurement average. In reality, these are ex- 
ceedingly small differences, and the qualitative descrip- 
tions above should not be interpreted too literally. At 
X-band, all antennas are below the nominal design expec- 
tation by an average of 0.14 dB. The S-band average is 
0.02 dB above the nominal design expectation. 

The improved agreement in 70-m actual performance 
against design expectations is directly traceable to im- 
proved knowledge gained through use of high-lateral- 
resolution microwave holographic imaging. The hologra- 
phy illumination amplitude image gives a clear view of 
quadripod shadowing while the phase image gives an ex- 
cellent estimate of small-scale surface roughness. All 70-m 
antennas yielded high-quality holographic data and the 
nearly 1,300 individual reflector panels on each antenna 
were adjusted according to those data. 

In actuality, all 70-m antenna surface (and focus) 
adjustments were seriously time-compromised. The high- 
resolution holographic data type inherently provides 
panel-setting data based on individual-panel 3-degree-of- 
freedom best fits, within a global best-fit reference system. 
Rather precise information (approximately 50 microns or 
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0.002-inch repeatability) for each panel corner is provided. 
In the time-constrained 70-m project a compromise was 
necessary. At each intersection of four panel corners, the 
necessary individual settings, which are in general differ- 
ent, were simply averaged. Thus, all panels are, in general, 
misset and in fact very slightly twisted. Further, panel cor- 
ners in average error by 0.015 inch (0.37 mm) or less were 
not adjusted at all. This is clear from the resultant 0.7- 
mm rms small-scale roughness achieved compared with the 
initial Project goal of 0.5-mm rms, and further compared 
to the holography-determined theoretical minimum (given 
perfect adherence to setting directions) of 0.3-mm rms. An 
rms of 0.3 mm is effectively the manufacturing precision 
of the panels and subflector, as independently determined 
during manufacture, plus a small incoherent holography 
data-system noise component. If time had permitted re- 
finement of the 0.7-mm quick setting to a practical 0.4 mm, 
the X-band gain would increase 0.13 dB, changing the av- 
erage measured 68.6 percent efficiency to 70.7 percent. At 
a higher frequency such as 32 GHz, the gain difference de- 
veloped by improving a 0.7-mm surface to 0.4 mm is nearly 

1.9 dB. 

VII. Gain Performance Increases 
A. S-Band 

Tables 1 and 2 of [1] give the design-expected S-band 
gain increase (70-m over 64-m) as +1.77 dB. From mea- 
surements, the efficiencies determined are 0.582 (+61.40 
dBi at 2295 MHz) for the 64-m antennas and 0.761 (+63.34 
dBi) for the 70-m antennas. The measured increase is thus 
+1.94 dB. Both 70-m measurement sets show very small 
differences between the antennas. The Project specifica- 
tion (+1.4 dB) was, of course, based on predesign esti- 
mates. Detailed design extracted 0.5 dB more gain than 
initial estimates for the 70-m antennas. 


B. X-Band 

Tables 3 and 4 of [1] give the design-expected X-band 
gain increase (70-m over 64-m) as +1.77 dB. However, this 
is based on the originally specified 64-m surface tolerance 
of 1.14 mm which is unrealistically low, as discussed before. 
Likewise, the estimated minimum-likely efficiencies in Ta- 
ble 6 of [1] for 64-m X-band (0.533 for DSS-43/63 and 0.541 


for DSS-14) are considered too high, due to the adopted 
one-third peak estimating method used. The measure- 
ments (simply averaged) show +74.18 dBi for 70-m and 
+72.06 dBi for 64-m antennas. On this simple averaged 
basis, the 70-m upgrade project provided +2.12 dB X-band 
gain improvement at or near the adjustment (peak gain) 
elevation angle. The best 64-m antenna, DSS-14, provided 
+72.33 dBi. After upgrade to 70-m, DSS-14 now provides 
+74.17 dBi. On this single antenna basis, the 70-m up- 
grade project provided +1.84 dB at DSS-14, the minimum 
improvement realized at X-band. For DSS-43 and DSS-63, 
the average X-band gain increase provided by the 70-m 
project is +2.27 dB. These improvements clearly exceed 
expectations simply because the 64-m surface tolerance 
was somewhat worse than our limited previous knowledge 
and estimates. 

Table 4 presents a summary of the S- and X-band 
64- and 70-meter antenna gains and efficiencies, plus the 
increases gained by the upgrade activities. Each reader can 
(and perhaps will) select his or her baseline of choice in this 
matter of three quite differently performing 64-m X-band 
antennas at the start of the project, and three very slightly 
differently performing 70-m X-band antennas at project 
conclusion. Nevertheless, perhaps the best value to use 
is the simple average of the before- to- after measures, that 
is, +2.1 dB at X-band. The specification (+1.9 dB) was, 
of course, based on predesign estimates. Detailed design 
extracted a bit less than the predesign X-band estimate 
from DSS-14. The necessary stovepipe feed and surface 
setting compromises, both of which are recoverable, would 
boost DSS-14 improvement from +1.84 to +2.08 dB with, 
of course, similar increases to the other 70-m antennas. 

Figure 4 presents 64-m and 70-m measured X-band 
data for comparison. Included in Fig. 4 are the 64-m and 
70-m design expectations (nominal, not estimated mini- 
mum-likely values). The peak design expectations are ob- 
tained directly from Tables 3 and 4 of [1]. The design- 
expected elevation functions are obtained from finite- 
element structural gravity distortion data, analytically 
smoothed for input to the GTD scattering program [8-10]. 
After smoothed GTD scattering a small-scale (nominally 
random) roughness term is added for the overall design 
estimate. Thus the large lateral-scale distortions are han- 
dled by direct integration, while the small-scale distortion 
is handled on the familiar statistical basis, according to 
the commonly used Ruze formulation [11]. This is the 
best technique known and possible at this time. Apply- 
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ing the simplified statistical method of Ruze is considered 
quite proper in this instance of 1,300 individual reflecting 
panels per 70-m antenna. 

In Fig. 4 it is clear that the 64-m and 70-m design 
expectations compared with DSS-14 data are in general 
agreement. The DSS-63 64-m antenna was not expected to 
follow the design elevation function due to the temporary 
dual structural bracing configuration as discussed. 

Both DSS-43 and DSS-63 70-m antennas do not 
presently follow the design-expected elevation function. 
Whether this is due to a real structural deficiency, a subre- 
flector focus control deficiency, poor pointing during data 
collection, or other reasons is not known. The indications 
are that pointing was acceptable. Additional work is nec- 
essary to understand the DSS-43/63 elevation functions, 
and focus tests at high and low elevation angles are the 
likely priority. Separately, some very valuable holography 
data was obtained at low elevation angles at DSS-14 and 
to a lesser extent at DSS-63. These data, with a lateral 
resolution comparable with structural rib and hoop dimen- 
sions, may assist in detailed understanding of the gravity 
behavior of the DSN 70-m antenna structures. 


VIII. System Noise Temperatures 

No formal predictions of 64- to 70-m antenna upgrade 
noise improvement (or degradation) were made. The rear 
spillover noise, a component of the total, was expected 
to increase roughly 1 K at S-band, a consequence of the 
X-band optimum design (the former 64-m systems were 
S-band optimized). At X-band, the rear spillover was ex- 
pected to decrease roughly 0.5 K. The deeper main re- 
flector resulting from 64- to 70-m diameter increase, with 
approximately the same effective focal length, could be 
expected to slightly reduce noise with the 70-m antenna 
pointing below about 30-deg elevation and at zenith. The 
reduced quadripod shadowing might provide slightly re- 
duced noise at all elevation angles. An upper bound on 
this might be 1 K. 3 


3 The 64-m systems utilized a special beamshaping flange 
around the periphery of the hyperboloidal portion of the sub- 
reflector, which was optimized for S-band. In any event, such 
appendages tend to be narrowband on a scale of an octave, thus 
of limited utility for wideband antennas. 


Despite the above 70-m design tendencies, the JPL 
Technical Division and Project position was that the sys- 
tem noise temperature change, if any, was too close to 
quantify without significant additional work. This section 
will therefore briefly record measured noise results. The 
results presented here are slight modifications of the initial 
results presented in [3]. During the period of mechanical 
upgrade, some of the maser preamplifiers were serviced, 
and in some cases, outright exchanges were made. With 
at least twelve masers disposed among three stations, such 
servicings and exchanges require sub calibrations and be- 
come a difficult measurement, bookkeeping, and report- 
ing task. Also during the period of mechanical upgrade, 
the X/S-band dichroic plates were serviced and upgraded. 
The paint in some cases appeared seriously checked and 
badly flaking from over 10 years exposure to weather. All 
three dichroic plates were stripped of paint and the alu- 
minum alloy base metal was irridited. The irridite process 
is a thin film and considered sufficient weather protection 
for several years. It is unknown how much steady-state 
(dry) noise degradation was being experienced with the 
paint seriously flaking. The expectation is that at least 
following rain, the recovery time to normal system noise 
temperature will be considerably improved due to more 
rapid drying of the irridited noncapillary surfaces. 

In summary, one might expect the S-band zenith noise 
to remain roughly the same and expect a small X-band 
noise zenith decrease. X-band noise recovery following 
rainy conditions should improve. Table 5 summarizes 
available measured zenith noise temperatures for 64- and 
70-m antennas at S-and X-bands. From the measured val- 
ues, composite S- and X-band noise temperature models 
were created, as described in [3]. The table is divided 
into two parts: measured totals (including atmosphere) 
and those values inferred for no atmosphere. The no- 
atmosphere data is based on local meteorological condi- 
tions existing at the time of the measurements and should 
be the more reliable (weather-independent) data for com- 
parison purposes. 

From Table 5, the DSS 63/64-m S-band noise ap- 
pears abnormally high, and remains so after 70-m conver- 
sion. The 20.7 ± 0.6-K DSS-14/43 S-band 64-m average 
(with atmosphere) is reasonable. After 70-m conversion, 
the DSS-14/43 average (with atmosphere) is again reason- 
able, at 18.9 ± 0.6 K. The average improvement of 1.8 K 
is a distinct S-band performance benefit. However, the 
atmosphere-free data is the more reliable. Here, the av- 
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erage difference is 1.5 K, still indicating a distinct S-band 
70-m performance benefit. 

At X-band, the DSS-14/64-m noise was abnormally 
high. However, after 70-m conversion all three anten- 
nas provided quite similar zenith noise performance of 
20.9 ± 0.3 K. Ignoring the abnormal DSS-14 64- to 70- 
m difference, contaminated by the high 64-m data, and 
taking the atmosphere-free data as the more reliable, the 
average difference of 0.7 K (70-m improvement) again pro- 
vides another distinct 70-m performance benefit. 

It is noted that the DSS-43 70-m antenna has the best 
S-band zenith noise performance, at 18.3 K, while the oth- 
ers are considerably higher. In decibels, DSS-14 is 0.2 dB 
worse, while DSS-63 is fully 1 dB worse at S-band, clearly a 
candidate for maintenance attention to either operational 
or calibration equipment. 

It is emphasized that considerable noise data are re- 
quired for a fully reliable performance description. Se- 
lected 70-m data are presented here. Figure 5 shows 70- 
m S-band total system noise temperatures (including at- 
mosphere) taken during the calibration measurements at 
three different stations. The DSS-63 operational or instru- 
mentation anomaly is clear, and these data are ignored for 
system noise temperature modeling. The DSS-43 18.3-K 
zenith temperature and the elevation dependence should 
be achievable in principle, and probably in practice as well, 
by all stations. 

Figure 6 shows three selected 70-m X-band total sys- 
tem noise temperature data sets, again including atmo- 
sphere. At the time the DSS-14 data were obtained, the 
dichroic plate had not yet been reinstalled. Thus the 19.7- 
K measured total DSS-14 zenith noise shown in Fig. 6 
projects to 20.9 K in the normal operating configuration 
with the dichroic plate installed. Remarkably, the three 
very low noise antennas perform quite uniformly with a 
zenith noise temperature of 20.9 ± 0.3 K at X-band. Spe- 
cial efforts should be made to maintain this highly desir- 
able and valuable performance condition. 

From the limited data in Figs. 5 and 6, and the total 
system temperatures at zenith discussed above, the best 
present estimates of the DSN 70-m S- and X-band sys- 
tem noise temperatures are plotted in Fig. 7. The upper 
pair of plots are total system temperatures (with atmo- 
sphere) as functions of elevation angle. These are the op- 
erational noise temperatures (with slight adjustments) ex- 
perienced in the nominal clear weather which existed at 


the time of the calibration measurements. In the same 
figure, the lower pair of plots represents antenna perfor- 
mance alone, in what may be referred to as the “vac- 
uum” condition, that is, with the nominal clear atmo- 
sphere removed. From atmosphere-removed plots, factors 
such as spillover and quadripod or other scatter may be 
estimated for detailed design evaluation purposes. For ex- 
ample, at 30-deg elevation, the atmosphere-removed 17.5- 
K (S-band) measurement is only 0.9 K above the the zenith 
(vacuum) of 16.6 K. Similarly, again at 30-deg elevation, 
the atmosphere- removed 20.6-K measurement (X-band) is 
only 2.2 K above the zenith (vacuum) of 18.4 K. These 
small noise terms are indeed difficult to know with great 
confidence, especially with limited data, but appear re- 
markably small and indicate the 70-m design has provided 
excellent low noise antennas. 

In any careful future measurements and comparisons 
of antenna noise performance, it is recommended the feed- 
cone hoist cables and heavy large lifting sheaves and hooks 
be removed from the topmost two quadripod legs. These 
operationally needless (service) tools can only serve to 
degrade antenna noise performance, due to an unknown 
amount of ground noise pickup from additional scattering. 

It can be concluded that the reduced cross-section 
70-m quadripod, in the slightly deeper main reflector en- 
vironment, is the key agent leading to 70-m system noise 
improvements. At zenith, 1.5-K S-band improvement and 
0.7-K X-band improvement is observed. Observing noise 
as a function of elevation angle, one sees very small spill- 
over and scatter components. It is quite clear, based on 
gain and noise temperature measurements, that the efforts 
expended to improve the 70-m quadripod structural design 
have yielded multiple benefits to microwave performance, 
at both frequency bands, to the three major antenna sys- 
tems. 


IX. Conclusion 

The detailed design-expected area efficiencies for the 
high-efficiency (dual-shaped reflector) DSN 70-m antennas 
range from 74.9 to about 75.8 percent at S-band, and from 
68.8 to about 70.8 percent at X-band, based on the best 
estimating practices available. Measurements carried out 
during 1988 show 76.1 percent has been achieved at S- 
band, and an average of 68.6 percent achieved at X-band. 
The highest measured peak efficiency (DSS-63) is 70.1 per- 
cent, while the lowest measured efficiency is 67.3 percent 
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(at the X-band) where small-scale surface roughness effects 
are dominant. 

The 70-m upgrade project achieved all objectives and 
more. The noise performance of all three antennas was 
improved, and it is clear from the limited gain and noise 
temperature measurements that the reduced cross-section 
quadripod contributed significantly to the overall perfor- 
mance improvement. 

Additional X-band microwave performance is easily 
obtainable: the feedhorn design can be improved by 

0.1 dB; and main reflector panels can be reset to obtain 
an additional 0.1 dB. Both the feedhorn and surface ad- 
justment compromises were based on time and financial 
resource limitations. Nevertheless, those remain impor- 
tant, but recoverable, performance compromises. 


One question remains unanswered. The structural/ 
gravity (or focus) performance with elevation angle change 
is not as good as expected for two of the antennas (DSS- 
43 and DSS-63). Further work in this area will be accom- 
plished in a post-project phase, beginning with focus tests 
at high and low elevation angles, and the study of valu- 
able low-elevation holography images. These are expected 
to reveal actual structural gravity behavior for comparison 
with theoretical models. 

The DSN 70-m project was a significant undertak- 
ing, and by any reasonable standard, was severely time 
constrained to fit between the Voyager Uranus (1986) and 
Neptune (1989) planetary encounters. Good design, man- 
agement, contracting, scheduling, fieldwork, and technol- 
ogy formed a powerful combination. The use of microwave 
holography proved to be vital to the Project success. 
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Table 1. 8420-MHz sources for DSN antenna calibrations 


Source 

Flux, Jy 

Source Size/Beamwidth Correction, C(R) 

64-m 

70-m 

3C274 

Old: 46.00 

Old: 1.085° 
1.089 6 

Old: 1.106 


New: 45.20 

New: 1.15 

New: 1.18 


(+0.076 dB) 

(+0.253 dB)“ 
(+0.237 dB) 6 

(+0.281dB) 

Others 

(+0.076 dB) 

Varies according to 

source size 


Notes: 

Jy — Jansky = 10~ 26 W/m 2 /Hz 

a JPL Document D-3801, Rev. A (DSS-43/63, 64-m calibration) 
6 JPL Document D-3801, Rev. B (DSS-14, 64-m, and initial 
70-m calibrations) 


Table 2. Summary of design expected and average measured 
efficiencies, 64-m antennas 


64-m 

Antenna/Band 

Design-expected 

nominal 

efficiency 

Estimated 

minimum-likely 

efficiency 

Average 

measured 

efficiency 

All/S-Band 

0.603 

0.591 

0.582 

DSS-14/X-Band 

0.565 

0.541 

0.535 

DSS-43/X-Band 

0.565 

0.533 

0.490 

DSS-63/X-Band 

0.565 

0.533 

0.487 


Table 3. Summary of design-expected and average measured 
efficiencies, 70-m antennas 


70-m 

Design-expected 

Estimated 

Average 

Antenna/Band 

nominal 

minimum-likely 

measured 


efficiency 

efficiency 

efficiency 

All/S-band 

0.758 

0.749 

0.761 

All/X-band 

0.708 

0.686 

0.686 
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Table 4. Summary of measured S- and X-band gain and 
efficiency results 



DSS-14 

X-Band 

DSS-43 

X-Band 

DSS-63 

X-Band 

All Stations 
S-Band 

64- m 

Gain, dBi 

72.33 

71.94 

71.91 

61.40 

Efficiency 

0.535 

0.490 

0.487 

0.582 

70-m 

Gain, dbi 

74.17 

74.10 

74.28 

63.34 

Efficiency 

0.684 

0.673 

0.701 

0.761 

64-70-m 

Gain improvement, dB 

+ 1.84 

+2.16 

+2.37 

+ 1.94 


Table 5. Summary of zenith system noise 
temperatures for 64/70-m antennas 



S-Band 

X-Band 

Antenna/System 

Zenith noise, K 

Zenith noise, K 


With Atmosphere 

DSS-14/64-m 

21.2 

25.2* 

DSS-14/70-m 

19.5 (-1.7) 

20.9 (-4.3)* 

DSS-43/64-m 

20.1 

21.8 

DSS-43/70-m 

18.3 (-1.8) 

21.2 (-0.6) 

DSS-63/64-m 

27.7* 

22.0 

DSS-63/70-m 

23.0 (-4.7)* 

20.6 (-1.4) 


Without Atmosphere 

DSS-14/64-m 

19.1 

23.2* 

DSS-14/70-m 

17.7 (-1.4) 

18.7 (-4.5)* 

DSS-43/64-m 

18.2 

19.2 

DSS-43/70-m 

16.5 (-1.7) 

18.7 (-0.5) 

DSS-63/64-m 

25.6* 

19.0 

DSS-63/70-m 

21.2 (-4.4)* 

18.1 (-0.9) 

* Considered abnormal, and change not representative of 

design improvement. 
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Fig. 1. DSN 64-m antenna X-band gain performance (without atmosphere) 

plus design expectation. 
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Fig. 2. DSN 70-m antenna X-band gain performance (without atmosphere), 

plus design expectation. 
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Fig. 4. DSN 64/70-m antenna X-band gain comparison (without atmosphere) 

plus design expectation. 
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Fig. 6. DSN 70-m antenna X-band system noise temperatures (raw data, with 
atmosphere) (DSS-14: no dichroic reflector). 
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Kalman filters have been used for some time to process clock measurement 
data. Due to instabilities in the standard Kalman filter algorithms, the results 
have been unreliable and difficult to obtain. Often, in order to obtain reasonable 
results, the data has had to be manually edited, the filter fine tuned, or the model 
adjusted by the analyst. During the past several years, stable forms of the Kalman 
filter have been developed, implemented, and used in many diverse applications. 

These algorithms, while algebraically equivalent to the standard Kalman filter, ex- 
hibit excellent numerical properties. Two of these stable algorithms, the Upper 
triangular-Diagonal (UD) filter and the Square Root Information Filter (SRIF), 
have been implemented to replace the standard Kalman filter used to process data 
from the DSN hydrogen maser clocks. The data are time offsets between the clocks 
in the DSN, the timescale at the National Institute of Standards and Technology 
(NIST), and two geographically intermediate clocks. The measurements are made 
by using the GPS navigation satellites in mutual view between clocks. The filter 
programs allow the user to easily modify the clock models, the GPS satellite de- 
pendent biases, and the random noise levels in order to compare different modeling 
assumptions. 

The results of this study show the usefulness of such software for processing 
clock data. The UD filter is indeed a stable, efficient, and flexible method for 
obtaining optimal estimates of clock offsets , offset rates, and drift rates. A brief 
overview of the UD filter is also given. 

I. Introduction ordinated Universal Time (UTC) as realized at the Na- 

tional Institute of Standards and Technology (NIST), with 
The requirements of the DSN-complex clocks are that a knowledge of 1 microsecond and that the rate of each 
each clock be maintained within 10 microseconds of Co- clock be kept within ±IE - 12 df/f with a knowledge of 
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±3E — 13 df/f, The quantity df/f is known as the frac- 
tional frequency deviation where 


df/f = 


and f m is the measured frequency and /o is the reference 
frequency. Also there is a requirement that a permanent 
record be kept of the synchronization and syntonization 
of the DSN-complex clocks. In order to meet these re- 
quirements, the time and frequency offset to UTC (NIST) 
is monitored using the Global Positioning System (GPS) 
navigation space vehicles. The goal (not yet realized) is 
to adjust the rates of all the clocks at the same time at 
approximately 200-day intervals. The clock rates (frequen- 
cies) are adjusted when they are above UTC (NBS) by a 
certain amount to below, and then allowed to drift through 
zero rate offset from UTC (NIST) until they are again high 
enough to require adjustment. This technique, done with 
some care, allows one to keep the clocks within the re- 
quired time and frequency offset limits and still not have 
any jumps in the time offset. All of the drift rates of 
the hydrogen masers are about the same, therefore, the 
complex clocks can be kept in close synchronization and 
syntonization. 

Monitoring of the DSN-complex clocks is done by the 
operations analyst at JPL. Adjustments of the clocks is 
recommended by the operations analyst with the advice 
of the engineering staff. The adjustment is scheduled af- 
ter a check of the users of the DSN and is made during 
a maintenance period at the complex. The actual adjust- 
ment is made by the engineering staff at the complex. It 
is assumed that the drift of the hydrogen maser rate is 
due to cavity drift, therefore, rate adjustments are made 
by adjusting the varactor, thereby changing the apparent 
cavity size of the hydrogen maser. 

At critical times during some projects (such as the 
Voyager Uranus encounter), it is desirable to have the 
clocks synchronized and syntonized more closely than the 
above specifications. When this is the situation, the oper- 
ations analyst will predict the time and rate of the DSN 
clocks 30 to 90 days prior to the encounter date and set 
the rate to allow the clocks to drift to the near zero time 
and rate offset by the critical time. At the time of the 
Voyager Uranus encounter, JPL engineering staff used a 
Kalman filter to predict the clock offset 60 days into the 
future with excellent results [3]. 


At JPL, a Kalman filter has been used to process 
clock offset measurements but it has never been used oper- 
ationally. The present operational environment for clock 
management in the DSN is a personal computer with a 
considerable amount of manual intervention. The data is 
handled in weekly batches. By 1992 the process is to be 
transferred to the Network Frequency and Timing subsys- 
tem (NFT), which is a part of the DSN Frequency and 
Timing system (DFT). The flow of data and estimate up- 
dates will occur more often (perhaps as often as once per 
hour) so as to allow operations to be able to monitor clock 
performance in near real time. This will require a fairly 
robust processing environment with automatic elimination 
of outliers and other techniques to make the process as 
autonomous as possible. It is expected that operations- 
analyst intervention will be relegated to filter adjustments 
in response to scheduled changes of time or rate in any of 
the clocks being entered into the filter parameters. 

The operations analysts will have the opportunity to 
do postanalysis of the data. In the near future, it is ex- 
pected that some data will continue to be available on a 
weekly basis and that better ephemerides will be available 
several weeks after the fact. At this time, use of the fil- 
ter/smoother by the operations analyst can produce better 
estimates of the clock parameters for archiving. To achieve 
these goals, a robust filter and smoother program is under 
development at JPL. The results reported in this article 
were obtained by running this program with data as de- 
scribed in the next section. No “fine tuning” adjustments 
or manual data editing were performed. The output, how- 
ever, was excellent. The filter used in the JPL program 
is the numerically stable Upper triangular-Diagonal (UD) 
form of the Kalman filter. The UD filter has been under 
development for several years [1] and has earned a repu- 
tation for being accurate and efficient. The more tradi- 
tional Kalman formulation has been known for some time 
to be numerically unstable. That is, due to rounding errors 
within the computer, the results produced by the filter pro- 
gram are sometimes completely wrong. The smoother used 
in this code is a recent improvement [2] of the Rauch-Tung- 
Streibel (RTS) smoother [5]. The revised RTS smoother 
is also robust and accurate. 

The filter and smoother programs make use of subrou- 
tines from the Estimation Subroutine Library (ESL). This 
is a collection of FORTRAN77 subroutines for construct- 
ing filters and smoothers. The routines in the ESL have 
been carefully coded and tested to ensure their accuracy 
and reliability. Overall, the goal is to produce optimal es- 
timates of clock parameters without manual intervention. 
The results of this first attempt are extremely encouraging. 
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II. Clock Measurements 


There is an NIST-designed GPS timing receiver lo- 
cated at each complex. A timing pulse from the complex 
master clock is fed to each receiver so that the data output 
from the receiver is the offset of the complex master clock 
to GPS timescale. The receivers are maintained on the mu- 
tual view schedule, which is generated by the Bureau Inter- 
national des Poids et Measures (BIPM) in Paris, France. 
These data are kept in a database along with data from 
UTC (RRL) and UTC (TAO) in Japan, UTC (NIST), 
and the NIST cesium clock at WWVH Hawaii, U. S. The 
data from Japan are obtained from the GE MKIII cata- 
logue, which is administered worldwide by the BIPM and 
inside the U. S. by the United States Naval Observatory 
(USNO). Each of the data lines is tagged to indicate the 
receiver from which it came and then placed in a file and 
stored on a disk in a personal computer. There is one file 
for each day, and it is named by Modified Julian Day. 1 

The first process is to take the first difference of the 
data to obtain the differences between the clocks on the 
ground, thus eliminating the space vehicle clocks. There 
are six clocks used; because of the geometry of the clock 
locations there are 11 possible mutual views available 
(Fig. 1). A program is used to produce a file which con- 
tains as entries the station pair, the satellite number, the 
MJD, the time (second) the data was collected, and the 
difference in time in tenths of nanoseconds between the 
first and second clock. (The above format could easily be 
adapted to include clock difference data from other sources 
such as two-way time coordination.) Data was generated 
by this program from MJD 47000 to 47132. There were 
approximately 4000 measurements in the data set. 


d{(t): the drift rate, that is, the second derivative of 
the offset, d, — d 2 Si/dt 2 . 

&*w(0 : the measurement biases. These depend on the 
two stations, i and it, and on the satellite, /, being ob- 
served. These biases will be explained in more detail 
• later. 

The dynamics model describes how the above quan- 
tities change over time. This is expressed as follows: 


S|(0 = r *(<) 

(l) 

r,(0 = M 0 

(2) 

•to 

tl 

*"13 

(3) 


where • denotes time differentiation, and tu,*($) is white 
noise, with zero mean and known power spectral density 
(psd) W . That is, 


E 


i^(f)u;i(r)J = W6(t — r) 


(4) 


where E [ •] is the expected value operation, and where 6 is 
the Dirac delta function. Note that Eqs. (1) and (2) are 
just restatements of the definitions of r, and d,-. Equa- 
tion (3) says that the rates are “approximately” constant. 

In order to apply a discrete filter to this problem, 
Eqs. (1-3) must be put into the form of a discrete dynamics 
equation 


III. Mathematical Model 

In order to describe the mathematical model used for 
this analysis, the following notation is introduced: 

Si(t): the clock offset at station i. That is, $,• = c,* — T, 
where c, is the clock reading at station i and T is the 
“true” time. For this analysis, “true” time is taken to 
be the time given by UTC (NIST). 

ri(t): the clock offset rate, also called the frequency 
error. This is just the time derivative of the offset, 
i.e., r< = dsi/dt. 

1 The Modified Julian Day (MJD) is a continuous day count 
with an initial epoch of 0000 hours UT on November 17, 1858. 


x(t + At) = $(t, At)x(t) + tu(t) (5) 

where x denotes a vector with components that include 
s,-, **«, di for each station. Note that all of these quanti- 
ties are time-dependent; however, to simplify the notation, 
this dependence will often be suppressed. The matrix 4>, 
called the transition matrix, is determined by writing out 
a discrete form of Eqs. (1-3). A simple way to do this, 
for example, at least for the offsets and rates, is to use a 
discrete form of Taylor’s Theorem: 

Si(t + A0 = Si(t) + n(t)At + 0M(t)At 2 (6) 

r,(t + AO = r,(0 + di(t)At (7) 
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For the drift rates, this same approach produces 

/■A< 

di(t + At) = d{(i) + J Wi(r)dT 


( 8 ) 


where Wi(t) represents the random noise. However, the 
effects of the noise on the drift rates are shown directly 
in Eq. (8), but this same noise also affects the offsets and 
frequency errors because of the presence of d{(t) in Eqs. (6) 
and (7). As written above, however, there will be a delay, 
caused by the time discretization, before the noise at time 
t propagates into the offset and rate. To avoid this delay, 
noise terms are included directly in the equations for Si 
and r,-. This is done by using the continuous form of the 
transition matrix to propagate the noise over the interval 
t to t -f A* . That is, if the noise that enters the continuous 
system at time t is given by the vector 


w(t) = (o,0,uv(0) 

where u; r (f) is white noise with zero mean and psd W r} 
then the integrated effect of the noise at time t + At will 
be the vector 

q(t + A<) = ( q„q r ,qd) T 

rp 

where ( q ,, q r ,q t ) ~ N(0,Q) and 
Q = E $ (At — t) w(r)dr J w T (<r)$ T (At — a)do 

= E [fL 


$(r)w(At — t)w j (At — a)# 1 (a)d<rdT 


= (0.5r 2 , r, l) T W r (0.5r 2 , r, l)dr 


( 9 ) 


since 


*(<) = 


1 t 0.5 t 2 
0 1 t 

0 0 1 


and E[wJ w; r ] = W r . The integration in Eq. (9) gives 


Q=w r 


r^A< 5 §A t 4 I At 3 
|A t 4 |A t 3 \At 2 


L t 3 2 At 2 


At 


( 10 ) 


Thus the discrete dynamics model for these filter states is 


where 


/ Si(t + At)) 


/«.(<) \ 

r,(< + A<) 

= 

»•.'(<) 

\ d{(t + At) J 


\di(t)J 


'1 At 

*A< 2 ‘ 

= 

0 1 

At 


.0 0 

1 


+ u>i 


(ii) 


( 12 ) 


Wi ~ N(0 , Q t ), and Qi is given by Eq. (10). There will be a 
transition, of the form of Eq. (11), for each station. Thus, 
the full transition matrix $ in Eq. (5) will have blocks of 
the form of Eq. (12) on its diagonal. The process noise 
term, w(t) i in Eq. (5) will have as its covariance matrix a 
block diagonal matrix, with 3x3 matrices of the form of 
Eq. (10) on the diagonal. 

The measurements to be used for estimating the off- 
sets, drifts, and drift rates are the differences in clock val- 
ues from the GPS times, as measured at two different sta- 
tions. That is, the measurements are 


z ikl (f j ) 


which are assumed to be of the form 


*.«(<;) = (ci(tj) - GPS,) - - GPS,) 

+ &»*/(*>) + Vik(tj) 

= (f J ) ” c k(tj) bikl(tj) H" v ik(tj) (13) 


where, as defined earlier, c,(fj) is the clock reading at sta- 
tion i at time tj. The fact that s;(i,) = c,(tfj) — NIST(tj) 
allows one also to write this as 
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Zik,(tj) = - NIST(t,)) - (c k (tj) - NIST(tj ) ) 

+ bikl(tj ) + v ik(tj) 

= s k(tj} "I" 

The first expression is used to obtain the measurement 
data as described in Section II. That is, the data actu- 
ally used for the analysis are values for c,(< ; ) — GPSj(tj). 
However, for the filter/smoother, the measurements are as- 
sumed to be of the second form, where the biases, 6,*/(tj), 
satisfy the trivial dynamics equation 

b ikl (t + At) = b ikl (t) (14) 

and the measurement noise, Vi k (tj), is, as usual, a nor- 
mally distributed zero mean random variable with con- 
stant variance r. The biases are appended to the “core 
states” to obtain the actual state vector used in the filter: 

x = (si, ri, di, S2, r2, c?2, ...» Sm, r m ,d m , 6ni, 6112, 6113, . . .) 

The transition matrix will have an identity matrix at- 
tached to it to reflect Eq. (14). Then the measurements 
can be written in standard form as 

Zj = HjXj + Vj (15) 

where Hj is a 1 x n measurement matrix, consisting, in 
this case, of two positive ones and a negative one, in ap- 
propriate positions. For example, a measurement based 
on observations from stations 1 and 3, and satellite 6, will 
have a measurement matrix 


H = (l i 0,0,0, 0,0, -1,0,0,. ..,0,1,0,...) 

where the final one appears in the position corresponding 
to bias 6 x 36 . 


IV. The UD Form of the Kalman Filter 

The Kalman Filter has become the standard tool for 
computing optimal estimates of a state vector, x(t), gov- 
erned by a mathematical model of the form of Eqs. (5) and 
(15). To describe the form of the filter that is of interest 
here, the following notation will be used: 


x k \j: estimate of x(t k ) using data up to time tj 

P k \j : covariance matrix for the error in x k y as an ap- 
proximation to x(tk) 

The Kalman Filter time update, also called the prediction 
step is 


£jbl*-l — ®kxk-l\k-l 

(16) 

Pk\k-i — $kPk-i\k-i$I + Q* 

(17) 


The measurement update is accomplished via the following 
set of equations; 


Bk — H k Pk\k-\Hk + Rk 


(18) 

* 

II 

>p 

1 

to 
*■ 1 

(Kalman gain) 

(19) 

P k \ k = {I-K h H h )P k 


( 20 ) 

Vk = Zk — HkXk\k-\ 

(innovations) 

( 21 ) 

£jb|jfc = £fc|ib-l + Kk^k 


( 22 ) 


The numerical instability of the Kalman equations, as 
noted in Section I, is due to the structure of these equa- 
tions. The most common indication that something has 
gone wrong is when rounding error causes P k \ k to be not 
positive definite. Note that it is not clear from these equa- 
tions that P k is even symmetric. 

The UD form of the Kalman Filter is algebraically 
equivalent to the above equations, but has much better nu- 
merical properties. It is based on the fact that any positive 
definite matrix P can be written as a product of matrices 

P = UDU t (23) 

where U is an upper triangular matrix with ones on the di- 
agonal, and D is a diagonal matrix, with positive diagonal 
elements. The idea behind the UD formulation is to start 
with the UD factors of P 0 1 0 , then compute the factors of 
all the remaining covariances, without ever computing the 
covariances themselves. It can be shown that by avoiding 
the covariances, the accuracy of the results is greatly im- 
proved. On the other hand, if the covariances are needed 
for output they can be easily obtained by simply multiply- 
ing the factors together. 
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To describe the time update in UD form, assume that 
the factors of P k j* are known from the previous step. That 
is, JJ k \ k and D k \ k have already been computed, where 

Pk\k = U kik D klk U^ k (24) 

Then x k+l \ k is computed just as in the Kalman formula- 
tion; but the equation for P k +i\ k becomes 

R fc+l|Jb = $kPk\k$7c + Qk 

= $kU k \kD k \ k U k \ k <bJ + Q k (25) 

Since $ k is upper triangular, so is the product $kUk\k* 
Hence, if Q k were zero, then the factors of P*+i|jb would 
be just 

RkD k R k 

where 

Rk = *uU klk (26) 

In order to account for the process noise, Q k is factored 

into its UD factors 

Qk = UqDqUj 

which can also be written as 

Qk = Y,d iqi qJ (27) 

1 = 1 

where qi is the ith column of U q . Then Eq. (25), using 
Eqs. (26) and (27) becomes 

n 

Rk+I\k = RkDkRk + y ^diqiqj (28) 

»=1 

Now the time update step is completed by finding the UD 
factors of P Jk _|_ 1 |j fc . This is done by a series of “rank one” 
adjustments to previous factors. That is, consider the gen- 
eral problem: given factors i?, jD, compute the factors of 
the matrix 

RDR T + dqq T 

where d is a positive scalar, and q is a vector. 


The matrix dqq T is a matrix of rank one. Hence dqq T 
is called a “rank one” adjustment to RDR T . There is a 
very accurate and efficient algorithm for computing the 
factors of this new matrix, given the factors R , D , and the 
quantities d and q. It is important here that the factors 
are not multiplied together, as this would destroy the nu- 
merical accuracy. This rank-one update algorithm, called 
the Turner- Agee Algorithm [1], is applied repeatedly to 
Eq. (28) to complete the time update step of the filter. 

Next, consider the measurement update, as given in 
the Kalman form by Eqs. (18-22). The Kalman equation 
can be used to process vector measurements. However, the 
UD form is more efficient if the measurements are scalars. 
In this case, in the measurement update equations, B k is 
a scalar, as is v ki and K is a vector. The measurement 
update is done several times, once for each scalar mea- 
surement for each time update. After the previous time 
update, the factors U k \ k -i and D k \ k _\ of P k \k-i are avail- 
able. The (scalar) B k as defined by Eq. (18) is computed 
as follows: 

B k = H k U k \ k ^iD k \ k ^iU k ^ k ^ 1 H k + Rk 
= yl^k\k-iyk + Rk 

where 

y* = 

is a vector. Next, Eqs. (19) and (20) are written as 
P k]k = (I-I< k H k )P k \ k _ x 

= - j^Rk\k-\Hk HkPk\k-x 

- U k \ k -\D k \ k .iU^ k _ l 

- ~^Uk\k~i^k\k-\U k \k~\Rk H k U k \ k ^\D k ^ k ^iU k ^ k _ 1 

= u k \k-x [am*— i - j- k vyT ] ^*-1 ( 20 ) 

where V is the vector defined by 

V = D k \k-xUlk-xHk 
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All that remains is to compute the factors U v , D v of 

Dk\k-i ~ j-VV T (30) 

Then it follows that 

Pk\k = U k \ k - 1 U v D v UZU? lk _ 1 (31) 

^*|k ^k|k uT.. 


Notice that Eq. 30 is a rank-one modification of a diagonal 
matrix. Hence its factors can be computed accurately and 
efficiently, just as was done in the time update. 

While this UD formulation may seem more compli- 
cated than the relatively simple matrix equations of the 
Kalman form (Eqs. 16-22), a computer program to imple- 
ment the UD filter is very straightforward. Two special 
subroutines are needed: 

(1) A subroutine to efficiently compute the product of two 
upper triangular matrices, such as 

(2) A subroutine to adjust the factors of a matrix by 
a rank-one addition: given factors £/, D } a positive 
scalar c, and a vector y, compute the factors of U DU T 
+ cyy T 

Then the time update is done by the following steps: 

(1) Compute R = $kU (sometimes called the determin- 
istic update) 

(2) For each process noise term d^qj, update the factors 
using the rank-one adjustment routine 

The measurement update is also broken down into a series 
of simple steps, one for each scalar measurement: 

(1) Compute the scalar B = y 7 ^ Dy + r where y = U T H T 

(2) Use the rank-one adjustment routine to compute the 
factors of D - £ VV T where V = DU T H T 

(3) Combine these factors with the previous factors, ac- 
cording to Eq. 31 


V. Filter Implementation 

In addition to the clock measurement data, as de- 
scribed in Section II, the filter routine must be given the 
following information: 

(1) Prior estimates of the state vector 

(2) Uncertainties associated with the state estimates 

(3) Process-noise standard deviations 

(4) Measurement-noise standard deviations 

Generally, the filter is somewhat insensitive to prior 
estimates of the state; hence, they are usually initialized at 
zero. The uncertainties must be large enough so that the 
filter will accept the measurement data and use it to adjust 
the state estimates, except that the offsets for the NIST 
station should remain close to zero. The process noise 
can depend on the station; however, for the results of Sec- 
tion VI, all stations were given the same amount of noise. 
Similarly, the measurement noise can be made a function 
of the stations and the space vehicle. For simplicity during 
this first analysis, however, the measurement noise uncer- 
tainty was assumed to be the same for all measurements. 

In order to run the smoother, certain information 
must be saved at each measurement update time. Thus, 
to limit the amount of storage required to run the filter 
and smoother, the program was set up to process mea- 
surements in so-called “mini-batches.” Each mini-batch 
consists of all measurements obtained during a fixed time- 
span, say one day, or a half a day. The midpoint of this 
timespan is chosen as the time of the update (also called 
the “epoch time”). The measurements within the mini- 
batch are translated to the epoch time by using the same 
transition matrix, $(ti,f 2 ), as is used in the dynamics 
equation. More precisely, suppose that the actual mea- 
surement is 

z(ti) = Hx(ti) + tv (32) 

where t\ denotes the time at which the measurement was 
taken. Let 1 2 be the epoch time for this mini-batch. Then, 
if it is assumed that the timespan for this mini-batch is 
sufficiently short, so that the integrated effect of the noise 
is negligible, one can write 

*(<i) = $(<2 ,<i)x(* 2 ) (33) 

so that Eq. (32) becomes 

z(*i) = H$(t 2 ,ti)x(t 2 ) + w (34) 
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That is, the measurement z(t\) becomes a measure- 
ment of the state at the epoch time, but with the measure- 
ment matrix given by H$ instead of just H. It is impor- 
tant to realize that this use of mini-batches can change the 
filter results because it assumes that the effect of the pro- 
cess noise over the timespan of the mini-batch is negligible. 
The code that was developed allows the user to select one 
or more mini-batches per day, or to process each measure- 
ment as it is received. 

The filter/smoother was implemented in FORTRAN- 
77, making use of routines from the Estimation Subroutine 
Library (ESL) to do most of the numerical computations. 
The UD form of the Kalman filter, as described in Sec- 
tion IV was used for the filter; the smoother made use 
of Bierman’s modification of the RTS smoother [2]. The 
general structure of the program is as follows: 

(1) Read a namelist file to initialize the filter: 

Start and stop times 

Maximum number of measurements to process 
Initial error covariances and state estimate 

(2) For each measurement: 

If new minibatch, 

(a) Do time update 

(b) Print estimates and estimate error standard de- 
viations 

(c) Translate measurements to epoch time and do 
the measurement update; write the smoother 
gains to a file for later use 

(3) After all measurements have been processed, print out 
the correlation coefficients 

(4) If the smoother has been requested, run the smoother 
back to the start time 

The results of Section VI were obtained by running 
the program with 6 stations, 11 station pairs, and 6 satel- 
lites. Hence the filter state vector has 84 components. An 
important feature of the filter implementation is the data 
editing capability. Before each measurement is actually 
included into the estimate, a test of the measurements 
consistency with past measurements is performed. This 
is done as follows. First the residual (or innovations) as 
defined by Eq. (21) is computed, as is the variance of this 
residual which is just the quantity Bk defined by Eq. (18). 


If the square of the residual divided by its variance is less 
than a user-specified tolerance, then the measurement is 
accepted and used to update the current estimates. For 
the results described in Section VI, a tolerance of 400 was 
used. This means that a measurement was rejected only 
if its residual was more than 20 standard deviations away 
from its mean value of zero. 

VI. Numerical Results 

The numerical results described in this section were 
obtained by running the filter and smoother on the data 
described in Section II. Approximately 10 data points were 
rejected by the data editor that is built into the filter. Fig- 
ure 2 gives an overview of the results of the smoother. This 
figure shows the offsets from UTC (NIST) of the three 
clocks in the DSN. This is a useful display for the oper- 
ations analyst. It can be used to tell when adjustments 
to the rates are in order. Notice that a rate adjustment 
would have been appropriate around MJD 47080; however 
an adjustment was not made until MJD 47130. The Allan 
variances of the smoother results, shown in Fig. 3, are con- 
sistent with those reported in [4], but are somewhat better. 
It is felt that these variances can be further improved with 
better modeling. 

In order to contrast the output from the smoother 
with the input to the filter, see Fig. 4(a). This shows 
the raw data for the California clock, with the smoother 
results superimposed. A rate of 1.2 E — 13 was removed 
from both plots. The smoother nicely bridges a gap in 
the data at about MJD 47050. During this gap, the stan- 
dard deviations computed by the smoother increased from 
about 4 nanoseconds, on MJD 47046, to 40 nanoseconds 
on MJD 47052, then dropped back to 4.4 nanoseconds on 
MJD 47053. Other data gaps were also handled well. This 
ability of the filter /smoot her to bridge data gaps will be 
important for operational uses. Figure 4(b) shows a super- 
position of the results produced by the JPL smoother and 
the NIST filtered estimates. Again, the 1.2E—13 rate has 
been removed from both results. The agreement is quite 
good, with less than 8 nanoseconds difference. The con- 
sistent bias, with the JPL results slightly lower, may be 
a result of the way in which the space vehicle biases are 
handled. 

The offset rate of the complex clocks is the most im- 
portant parameter from the operational point of view. 
Presently, the rate is determined by taking the mean of all 
the GPS measurements for ten days, then doing a least- 
squares linear fit and using the slope of that fit as an esti- 
mate of the rate. 
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The results of these “hand calculations” are shown as 
dots in Fig. 5. Generally, there is good agreement between 
these results and those produced by the filter/smoother. A 
careful examination of the times at which the hand calcu- 
lations differ noticeably from the smoother results revealed 
two reasons for these discrepancies. For example, in the 
vicinity of MJD 47060-47070, there was very little data 
for the clock in Spain. This caused the smoother to re- 
port a large uncertainty in its rates over this interval, and 
explains the difference between the smoother output and 
the hand calculation shown in Fig 5(c). The point ob- 
tained by the hand calculations for the clock in Australia 
on MJD 47070 was found, on closer examination, to have 
been calculated incorrectly. The correctly calculated value 
is very close to that produced by the smoother. 

VII. Conclusions and Future Work 

The UD form of the Kalman Filter appears to solve 
many of the problems encountered in the past with pro- 
cessing clock data on a routine basis. The numerical stabil- 
ity of the filter/smoother provides an accurate and reliable 
method for computing optimal estimates of the offsets, off- 
set rates, and drift rates. The automatic data editing fea- 
ture removes bad data points appropriately. 


The first several days worth of the data that were used 
for this study seemed to be very poor. For this reason, 
data corresponding to the first four days were skipped. 
As a comparison, this same data was also processed by 
using a Square Root Information Filter (SRIF). The SRIF 
is an even more numerically reliable form of the Kalman 
Filter. This increased reliability, however, is attained at 
the cost of an increase in computational time. The SRIF 
was able to handle the bad data at the beginning of the 
data set with no problems. Since computer run-time is of 
little concern for this application, it may be advisable to 
replace the UD filter with a SRIF. 

Possible improvements to the mathematical model 
used in the filter would be of interest. For example, it 
is clear that the assumption about the biases being con- 
stant is incorrect. An improved model would allow the 
biases to be time varying. The rate at which they should 
vary, however, would have to be determined by statistical 
parameter estimation techniques, such as maximum likeli- 
hood estimation. Also, in the current model, all clocks are 
treated as if they were identical. In fact there are three 
different types of clocks in the DSN. Special characteristics 
of the clocks could be taken into account to improve the 
accuracy of the model. 
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Fig. 5. Kalman smoother output and hand calculation of 
rate using a linear least-squares fit over 10 days of the 
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A Simple Model for DSS-14 Outage Times 

H. C. Rumsey , 1 R. Stevens, and E. C. Posner 
Office of Telecommunications and Data Acquisition 


A model is proposed to describe DSS-14 outage times. Discrepancy Reporting 
System outage data for the period from January 1986 through September 1988 are 
used to estimate the parameters of the model. The model provides a probability 
distribution for the duration of outages , which agrees well with observed data. 
The model depends only on a small number of parameters , and has some heuristic 
justification. This shows that the Discrepancy Reporting System in the DSN can 
be used to estimate the probability of extended outages in spite of the discrepancy 
reports ending when the pass ends. The probability of an outage extending beyond 


the end of a pass is estimated as around 

I. Introduction 

A model is proposed to describe DSS-14 outage times. 
Outage data for the period from January 1986 through 
September 1988 are used to estimate the parameters of the 
model. The model provides a probability distribution for 
the duration of the outages. The model does not address 
questions about the mean time between outages. However, 
it does allow estimation of the probability of major outages 
even though the Discrepancy Reporting (DR) system stops 
recording them at the end of a pass. The philosophy has 
been that the best model is the simplest one that fits well 
enough. 

The nature of the model, as above, is affected by the 
way outages are reported. If an extended outage occurs, 
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percent. 

the time to restore service is not reported. Only the time 
lost for that pass is recorded. This limitation makes it 
difficult to determine the actual outage durations from the 
DR data and must be accounted for in the model. In the 
model, it is assumed that the actual time to restore service 
is hidden by a “cutoff” process which corresponds to the 
end of the pass. A typical pass runs for 9 hours. This 
means that the actual time to restore service is masked 
by a 9-hour cutoff window. (In developing the model, 8-, 
9-, and 10-hour cutoffs were tried, with 9 fitting best and 
being reasonable on other considerations.) 


II. Outage Distribution 

Let R(t) be the distribution function for reported out- 
age durations, and let A(t) be the distribution function for 
actual outage durations. Assuming the “cutoff” process 
and outage durations are independent, then for t > 0, 
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Pr(reported time > t) 

= Pr(actual time > <)Pr(cutoff time > t) 
or 

l-R(t) = (l-A(tm-t/U) + (1) 

where U is the duration of the pass (nominally 9 hours) 
and (1 — </[/)+ is 0 for t > U. Equation (1) assumes that 
the start time of an outage is uniformly distributed over 
the duration of the pass. 

Equation (1) deals with the “cutoff” problem but says 
nothing about the actual distribution of outages A(<). The 
model for A(t) is based on the reported outage data and a 
desire to minimize the number of parameters in the model. 
Figure 1 shows this measured outage data from the DR 
system, accurate to one minute. Thus, some minutes show 
multiple outages. There are 498 outages presented in this 
figure. The mean time to restore service is about 40 min- 
utes. 

The simplest model to fit such data would be an ex- 
ponential distribution. The distribution function of an ex- 
ponential random variable X of a mean a is given by 

F(t) = Pr(X <<) = !- e ~ ,,a 

The fit is not accurate for short outages because Fig. 1 
shows the density goes to 0 at 0- length outages, but an 
exponential has maximum density at 0-length outages. An 
exponential random variable with mean around 40 minutes 
fits the first part of the outage data fairly well beyond a few 
minutes up to about 100 minutes, but there are too many 
extreme values (t > 100 minutes) in the outage data. This 
suggests that there are two (or more) classes of failures, in 
addition to the short failures. For the first class of long 
failures, service can be restored quickly, in less than 40 
minutes on the average. The second class of long failure 
requires more time to overcome. 

III. Model 

The following model has been adopted, with three pa- 
rameter a, a, and b to be estimated, since U is 9 hours and 
not estimated: 


Pr(reported time >t) = l — R(t) 

= ((l-o)(l + t/a)e-‘/° 

+ a(l + </6)e-*/*)(l - t/U)+ 

( 2 ) 

The form for this tail distribution has been taken to make 
the density function (essentially) zero at t = 0. Observe 
that if T a is a random variable with distribution function 
1 — (1 -M/a)e~*/°, then the expected value of T a is 2 a while 
the maximum of the density function of T a occurs at t = a. 

The parameters a and b occur symmetrically in 
Eq. (2). If a is chosen to be the smaller value, then out- 
ages of class “a” can be arranged to peak around t = 10 
minutes. Outages of class “6” can be arranged to fit the 
tails of the observed outages. More heuristics appear in 
Section VI. 

IV. Parameter Estimation 

Equation (2) defines the model. To complete the 
model, good values for the parameters {/, a, a, and b must 
be found. The maximum likelihood method will be used to 
estimate the parameters. Let (t\, . . . >t n ) be the reported 
outage times. The ts are reported by the DR system to 
the nearest minute. Let t be one of the outage times. The 
models specify the probability p(t) that an outage has du- 
ration f, where t is measured in minutes. Namely, 

p(t) = R(t + 1/2) - R(t - 1/2) 

where R is determined by Eq. (2). The probability that 
the observed outages occurred is the product of the prob- 
abilities of the separate outages, 

n p(<i > 

j = i 

This product is the likelihood function of the observations. 
It is a function of the model parameters {/, a, a, and 6. 
Maximum likelihood says to choose these parameters to 
maximize this product. The maximization is easy in this 
case because there are so few parameters. 
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There is a minor problem in determining the best 
value for the parameter U . Recall that U is the cutoff time 
for the pass. Finding U is like finding the end point of an 
interval (0, U) in which a uniform random variable occurs. 
A little thought shows that the likelihood function for such 
a problem is maximized by taking U to be as small as pos- 
sible. In the case of interest here, this would correspond 
to taking U to be slightly less than 8 hours (the maximum 
reported outage is 462 minutes). If extreme outages were 
common, this “defect” in the maximum-likelihood method 
would be no problem. Yet, the number of extended out- 
ages is small. So rather than estimate U from the likeli- 
hood function, U has been taken to be 9 hours throughout. 
To see how this selection affects the results, U = 8 and 10 
hours for the case Fj, = 1 were also tried. As expected, 
U = 8 hours gives a larger value for the likelihood func- 
tion, but the other parameters a and a are hardly changed. 
The results for U = 10 hours were not as good. Only 
U — 9 hours is considered below. This is consistent with 
the known distribution of pass lengths. 


V. Goodness of Fit 

A grid of points was used to find the maximum- 
likelihood values for the parameters. The maximum- 
likelihood values found were 

a = 0.186 
a = 11.4 minutes 
b — 77.5 minutes 

The corresponding distribution function is then given by 


played in Fig. 2. Qualitatively, the model fits the observed 
outages very well, for short, medium, and long outages. 

The mean time MTR to restore service for the model, 
is given by 

MTR = (1 - a)aT(U/a) + abT(U/b ) 

where 

T(x) = 2-3/x + (l + 3/x)e" x 

Substituting for a, a, 6, and U = 9 hours (540 minutes) as 
always gives 

MTR = 40.6 minutes 

This is in excellent agreement with the observed MTR of 
40.4 minutes. 

The probability that an outage exceeds 150 minutes 
was computed. With 498 outages, the model predicts there 
should be 28.4 outages of duration 150 minutes or longer. 
In the actual data of 498 outages there were 24 outages 
of this duration. Since the extended outage statistics are 
expected to be Poisson with an estimated mean of 28.4 and 
thus a sigma of \/28.4 = 5.33, the discrepancy of 4.4 is less 
than one sigma. This fits as well as could be expected 
with only 24 events. The probability of short and medium 
outages is also seen to fit very well. Hence, the use of the 
model seems indicated. 


Pr(reported outage > t) = 


VI. Heuristics 


[o.814(l + t/11.4)e-‘/ 114 


+0.186(1 + </77.5)e"‘ /77 5 


(1 — f/540)+ 


This equation represents the model. To see how well 
this model fits the observed outage data, the outage data 
was smoothed with a 5-point smoothing filter. The same 
filter was applied to the model as well. The results are dis- 


It can be expected that the density of outages near 
zero outage time is very nearly 0, because it takes some 
minimum time to notice an outage and to respond to it, 
even by switching in a hot standby automatically. The 
(1 -M/u)e _ */ a term in Eq. (2) does just this — it has den- 
sity 0 at 0, for the corresponding density is (t/a 2 )e“^°. 
This density also covers intermediate outages, but so does 
the “ft” class. One might think that yet another distribu- 
tion should be mixed in to cover these intermediate outages 
that cannot be recovered merely by switching something 
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in. But as has been seen, adding an extra one or two pa- 
rameters is not necessary — the fit is very good with just 
the single time parameter a, the location of the maximum 
density of the short and medium outages, and the addi- 
tional parameter a which gives the relative fraction of long 
outages. 

The long outages are described by a similar distribu- 
tion (1 + f/6)e~*^ with b a. Why the tail of this and 
the shorter-outage distribution should be exponential is 
less clear, but the fit is good, and hard to tell from a dis- 
tribution with a long low constant tail given the amount 
of data. It can be observed that the form of the a and 
b distribution arises as a difference of pure exponentials 
with infinitesimally close memoryless repair rates, but this 
does not seem to help the heuristics. The long tail can 
arise from certain failures such as low-noise maser warm 
up that takes a certain minimum time, e.g., 12 hours to 
recover from, when hot standbys for switching in are not 
provided. 


Finally, as explained, the (1 — t/U)+ multiplier term 
in Eq. (2) arises from the truncation of outage data at the 
end of a pass, where it was assumed that failures occur uni- 
formly over the duration (U = 9 hours) of a pass. It is this 
truncation which makes it hard to distinguish a negative 
exponential tail from a long flat tail. The three- parameter 
model has been adopted even though the heuristics are not 
perfect. 

VII. Summary 

It has been shown that the Discrepancy Reporting 
System in the DSN can be used to give good estimates of 
the probability of extended outages in spite of the discrep- 
ancy reports ending when the pass ends. The probability of 
a major outage (one extending beyond the end of a pass) 
is estimated by the best-fit model as around 5 percent. 
The model also gives good estimates for the probability of 
short, medium, and long outages. It is simple and yet fits 
very well. 
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Ka-Band MMIC Beam Steered Transmitter Array 

D. L. Rascoe, A. L. Riley, J. Huang, V. Lubecke, and L. DufFy 
Spacecraft Telecommunications Equipment Section 


A 32-GHz six-element linear transmitter array utilizing Monolithic Microwave 
Integrated Circuit (MMIC) phase shifters and power amplifiers has been designed 
and tested as part of the development of a spacecraft array feed for NASA deep-space 
communications applications. Measurements of the performance of individual phase 
shifters, power amplifiers, and microstrip radiators were carried out, and electronic 
beam steering of the linear array was demonstrated. 


I. Introduction 

Downlink communication systems for NASA deep- 
space missions presently operate at X-band (8.5 GHz). 
However, in the mid-1990s, advanced deep-space missions 
may utilize Ka-band systems (32 GHz downlink, 34 GHz 
uplink) to achieve communications enhancement on the or- 
der of 8 dB. A receiver Monolithic Microwave Integrated 
Circuit (MMIC) array at Ka band has been reported [1], 
but a transmitter array, a critical element in these systems, 
has not. 

At JPL, a 32-GHz solid-state transmitter is under de- 
velopment utilizing state-of-the-art GaAs MMIC devices. 
The initial goal of this work is to produce a Ka-band pla- 
nar phased array with 5-W output power to feed a 4-m 
reflector system. The electronic beam-scanning capability 
of the phased array will allow the narrow beam (« 0.2 
deg) of the large reflector to perform fine beam pointing 
toward the Earth (±1 deg). As a first step in this effort, 
a linear array composed of six subarrays of microstrip ra- 
diators, six MMIC switched-line phase shifters, and six 
MMIC power amplifiers was designed and tested. The 
MMIC devices were developed under programs funded by 


the NASA Lewis Research Center. This article reports 
the performance results and mathematical modeling of the 
JPL linear transmitter array incorporating these devices. 


II. MMIC Devices 

The MMIC phase shifters were designed and fabri- 
cated on GaAs by the Honeywell Sensors and Signal Pro- 
cessing Laboratory [2, 3]. A photograph of a phase shifter 
is shown in Fig. 1. A single phase shifter contains 12 
MESFETs in a three-bit switched-line configuration and 
2 MESFETs in an analog loaded-line configuration. The 
switched-line circuit provides nominal phase changes of 45, 
90, and 180 deg. The loaded-line design provides a nomi- 
nal continuous phase variation from 0 to 45 deg. These de- 
vices were originally designed for operation at 30 GHz for 
NASA’s ACTS program rather than at the slightly higher 
Deep Space Network frequency of 32 GHz. 

The MMIC power amplifiers were designed and fabri- 
cated by the Texas Instruments Central Research Labora- 
tory [4]. A photograph of an amplifier is shown in Fig. 2. 


207 



The devices used in the array were two-stage amplifiers 
utilizing GaAs MESFETs with a 0.25-/im gate length and 
gate widths of 0.1 mm and 0.3 mm for the first and second 
stages, respectively. The design goal of this device was to 
produce 20 dBm of output power. 


III. Array Design 

The linear phased array, shown in Fig. 3, is com- 
posed of four major “building blocks:” (1) a six-way mi- 
crostrip power divider, (2) a carrier strip of six MMIC 
phase shifters, (3) a carrier strip of six MMIC power am- 
plifiers, and (4) an antenna array of 6 pairs of microstrip 
patch radiators. All blocks were separately fabricated and 
tested to allow evaluation of the performance of each de- 
vice and for building a detailed model of the linear array. 
This “building-block” design approach of the array also 
allows for ease of replacement of any of the major compo- 
nents. Thus as improved MMIC devices become available 
or evaluation of different antenna elements is desired, the 
devices or elements may be readily integrated into this 
test-bed. 

The RF input to the array is through a WR-28 
waveguide-to-microstrip transition of the Van Heuven [5] 
antipodal finline type. The double-sided transition pat- 
tern is etched on 0.25-mm-thick Rogers 5880 Duroid and 
attached to the gold-plated aluminum housing using In- 
dalloy 121 solder. 

The six-way microstrip power divider consists of a set 
of 90-deg branch-line couplers with compensating lengths 
of microstrip interconnecting transmission lines to produce 
equal amplitude and phase at all six outputs. A close-up 
of the divider is shown in Fig. 4. The divider pattern was 
etched on 0.12-mm-thick Rogers 5880 Duroid laminated 
on an aluminum carrier. Each coupler was terminated by 
a 50-fi TaN thin-film chip resistor connected to an approx- 
imately quarter-wave-length radial line open-circuit stub. 
The power divider was designed using the EEsof Touch- 
stone simulation and MiCAD layout systems. A table of 
simulated versus measured data for all six ports at 32 GHz 
is shown in Table 1 . 

The MMIC carrier strips were designed to permit 
measurement of each MMIC device and to interface with 
the array. Individual devices were measured by attach- 
ing waveguide-to-microstrip transitions to the carrier as 
shown in Fig. 5. The carrier strips consisted of laser cut, 
0.25-mm-thick alumina substrates with etched TiW/gold 
circuit metalization that were soldered to gold-plated ko- 
var carrier strips. The dc and control lines were brought 


through the carrier and substrate via miniature coaxial 
feedthroughs mating on the underside with dual in-line 
pair (DIP) socket connectors. The MMICs were attached 
to the carriers with silver epoxy and wire bonded to the 
alumina substrates with 25-/im-diameter gold wire. 

The antenna array, consisting of six pairs of microstrip 
radiators that radiate in a direction normal to the plane 
of the dielectric, was designed using the multimode cavity 
theory [6]. This theory assumes that the electric field in 
the cavity underneath the patch can be precisely modeled 
by a series of cosine modal functions. Since the funda- 
mental mode, as well as the higher-order modes, can all 
be included in the analysis, accurate prediction of the res- 
onant frequency, copolarization and cross-polarization ra- 
diations, input impedance, and bandwidth effect can be 
achieved. 

The antenna array was etched on a 0.25-mm Rogers 
5880 Duroid substrate. Each etched radiator pair consists 
of two rectangular microstrip patches combined by a two- 
way reactive power combiner. A typical input return loss 
for a two-patch array, shown in Fig. 6, at 32 GHz was 
nominally -20 dB with 2.5 percent bandwidth. The sep- 
aration between radiator pairs was 1.08 free-space wave- 
lengths. This spacing was chosen to allow ease of assembly 
so that DIP sockets could be used at each MMIC position. 
Since only ±10 deg of beam scan is needed for the array 
to achieve the required reflector fine beam pointing of less 
than 1 deg, the relatively wide subarray spacing did not 
generate any serious grating lobe problems. Figure 7 is a 
calculated far-field pattern [7] of the 12-patch linear array 
with a 10-deg beam scan and 1.08- wavelength subarray 
spacing. 


IV. Test Results 

To ensure that the microstrip array gives adequate 
performance, the six pairs of patches were connected di- 
rectly to the six-way power divider without the amplifiers 
and phase shifters. The far-field pattern was measured 
and compared with the calculation as shown in Fig. 8. It 
demonstrated excellent pattern performance. The mea- 
sured array gain, excluding the power divider loss, was 
15.9 dBi, which indicates an efficiency of 83 percent. In- 
dividual measurements of single and double patches pre- 
ceded the 12-patch array measurement, and Table 2 shows 
a comparison of the radiator efficiencies in these three 
cases. 

The MMIC devices were measured on an extended 
HP8510 at 32 GHz in the carriers. Phase-shift measure- 
ments were referred to the nominal zero phase state of 
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each device and attenuation was referred to back-to-back 
waveguide-to-microstrip transitions. Figure 9 is a plot 
of nominal phase versus measured phase for six phase 
shifters. The devices used in the array were from two 
wafers, each made with slightly different mask sets. The 
mean and standard deviation of the phase shift was 7.0 ± 
1.6 percent over the range 0 to 300 deg. Attenuation as 
a function of nominal phase angle is shown in Fig. 10. 
The mean attenuation at zero phase shift is 10.4 dB for 
the first group and 8.0 dB for the second. The attenuation 
changed at a rate of approximately 0.8 dB/100 deg and 2.8 
dB/100 deg for the two groups. For phase settings larger 
than 250 deg, two units fail to track the other four, and 
their attenuations become larger. This large variation in 
attenuation performance is partially explained by the fact 
that these devices were originally designed for a lower fre- 
quency, 30 GHz, and for another application. Despite this 
variation in attenuation, however, reasonable array beam 
steering was obtained. 

The gain versus input power of the MMIC power am- 
plifiers is shown in Fig. 11. The six amplifiers were fabri- 
cated from the same wafer and have a low power-gain mean 
and standard deviation of 13.7 ± 1.7 dB. The mean and 
standard deviation of output power at 1-dB gain compres- 
sion is 13.6 ± 0.3 dBm. Several amplifiers were externally 
tuned and were found to be capable of producing as much 
as 21.5 dBm with an efficiency of 14.5 percent at 1-dB gain 
compression. 

The complete integrated linear phased array was 
tested by measuring its beam steering pattern. This pat- 
tern, shown in Fig. 12, was measured with a different tech- 
nique from the conventional method. Both the receive 
antenna, which was a 10-cm reflector, and the test trans- 
mit array were placed in fixed positions facing each other. 
Then the array’s phase shifters were adjusted to scan the 
beam at 1-deg intervals from —25 deg to 25 deg. Both the 


measurement system and the phase-shifter bias switching 
system were controlled by a PC. A program was written to 
set the phases based on keyboard entries of beam-steering 
angles. The pattern was then recorded with the received 
power versus the scan angles. Although the pattern was 
measured from -25 deg to 25 deg, our primary interest 
is in the ±10-deg region. Nevertheless, excellent compar- 
ison between the calculated and measured results can be 
observed in Fig. 12. The relatively high sidelobes were 
caused by improper operation of one of the middle am- 
plifier modules. The calculation included the effect of the 
failed module as well as randomized errors in both phase 
and amplitude. If all the six subarrays were functioning 
properly, the calculation predicted that the sidelobes are 
— 12 dB below the beam peak. The array had a beamwidth 
of 7.5 deg and demonstrated acceptable beam steering over 
±8 deg. 


V. Conclusions 

A 32-GHz six-element linear transmitter array utiliz- 
ing MMIC phase shifters and power amplifiers has been 
developed and tested as a precursor to the design of a 
two-dimensional array for use in a NASA spacecraft array 
feed. The switched-line phase shifters were accurate to 
within 7 percent on average and the power amplifier 1-dB 
compressed output power varied over 0.3 dB. The array 
had a beamwidth of 7.5 deg and demonstrated acceptable 
beam steering over ±8 deg. 

From the above results, it can be concluded that this 
MMIC phased array has adequate beam-scanning capa- 
bility for use in the two-dimensional array. The areas 
that need to be improved are the efficiency of the MMIC 
power amplifier and the insertion loss of the MMIC phase 
shifter. 
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Table 1. Simulated versus measured data for six- way power divider 


Out Port 




Simulated 




1 

2 

3 

4 

5 

6 

Avg. 

<T 

MAGS21 

-10.6 

-10.7 

-10.6 

-10.5 

-10.6 - 

-10.6 

-10.6 

0.1 

ANGS21 

-11.0 

-9.0 

-9.0 

-10.0 

-8.0 

-8.0 



adjANG 

0.0 

1.0 

1.0 

1.0 

2.0 

2.0 

1.0 

1.0 





Measured 




Out Port 

1 

2 

3 

4 

5 

6 

Avg . 

<J 

MAGS21 

-11.1 

-10.9 

-11.3 

-11.9 

-11.5 - 

-11.8 

-11.4 

0.4 

ANGS21 

71.0 

52.0 

77.0 

77.0 

54.0 

75.0 



adjANG 

0.0 

-19.0 

6.0 

6.0 

-17.0 

3.0 

-4.0 

11.0 




[Measured- 

•Simulated! 




Out Port 

1 

2 

3 

4 

5 

6 

Avg. 

a 

MAGS21 

-0.5 

-0.2 

-0.6 

-1.4 

-0.9 

-1.2 

-0.8 

0.4 

ANGS21 

0.0 

-20.0 

4.0 

5.0 

-20.0 

1.0 

-5.0 

11.0 


Table 2. Ka-band microstrip antenna gain and efficiency 


Parameter 

Single 

Patch 

Two-Patch 

Array 

12-Patch 

Array 

Calculated directivity, dBi 

7.3 

10.7 

16.7 

Measured gain, dBi 

6.9 

10.2 

15.9 

Efficiency, percent 

91 

89 

83 



Fig. 1. Honeywell MMIC phase shifter. 





Fig. 2. Texas Instruments MMIC power amplifier. 
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Fig. 5. MM 1C phase-shifter carrier strip between two waveguide-to-microstrip 

transitions. 


SCAN ANGLE, deg 


FREQUENCY. GHz 


Fig. 7. Calculated far-field pattern of the 12-patch 
array with a 10-deg beam scan. 


Fig. 6. Measured input return loss for a two-patch array. 
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Fig. 8. Comparison of calculated and measured far-field pattern for the linear 
array without MM 1C carrier strips. 
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NASA’s Search for Extraterrestrial Intelligence (SETI) project utilizes two 
complementary search strategies: a Sky Survey and a Targeted Search. The SETI 
team at the Jet Propulsion Laboratory (JPL) in Pasadena, California 7 has primary 
responsibility to develop and carry out the Sky Survey part. This article describes 
progress that has been made developing the major elements of the survey including 
a 2-million channel wideband spectrum analyzer system that is being designed and 
constructed by JPL for the Deep Space Network (DSN). The new system will be a 
multiuser instrument’ it will serve as a prototype for the SETI Sky Survey processor . 

This prototype system will be used to test the signal detection and observational 
strategies on DSN antennas in the near future. 


I. Introduction 

NASA's Search for Extraterrestrial Intelligence 
(SETI) microwave observing project is preparing to carry 
out two complementary search strategies: a Sky Survey 
and a Targeted Search. The NASA Ames Research Cen- 
ter has primary responsibility for developing and carrying 
out the Targeted Search. Similarly, the Jet Propulsion 
Laboratory (JPL) has primary responsibility to develop 
and carry out the Sky Survey. The two approaches are 


complementary in that the Targeted Search stresses sen- 
sitivity to detect either pulsed or continuous signals, and 
the Sky Survey gives up sensitivity in order to survey the 
99 percent of the sky that is not covered by the Targeted 
Search. 

The objective of the Sky Survey is to search the entire 
sky over the primary frequency range from 1.0 to 10.0 GHz 
for evidence of narrowband signals of extraterrestrial ori- 
gin. Frequency resolutions as narrow as 20 Hz will be used. 
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RELATIVE POWER, dB 


INPUT POWER, dBm 


Fig. 11. The spread in measured gain-versus-input power of six power 

amplifiers. 
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SCAN ANGLE, deg 

Fig. 12. Measured and calculated beam-scanning 
patterns for the transmitter array. 





Table 1 is a summary of the objectives of the survey. A 
significant technical challenge is to design an observational 
system that will meet maximum sensitivity requirements, 
use existing radio telescopes, and allow completion of the 
survey in a “reasonable” time span of 5 to 7 years. The 
basic strategy for conducting the survey is introduced in 
[1]. This article describes progress that has been made and 
gives a high-level description of the hardware and software 
of a prototype signal processing system that is currently 
under construction. 


II. Antenna Scan Strategy 

Several scan strategies were considered for the Sky 
Survey and parametric studies were conducted to help in 
the selection process. These studies clearly showed that 
the antenna beam must be actively scanned across the sky 
at a high rate if the survey is to be completed in the allot- 
ted time. The NASA SETI plan calls for the Sky Survey to 
use the 34-m DSN antennas. Because these antennas will 
not be dedicated to SETI, the scan strategy is quantized 
so that blocks of time as small as 6 hours can be scheduled 
and efficiently used. 

To accommodate the strategy, the celestial sphere is 
divided into 900 sky frames that are 60 deg wide by one deg 
high. The shape and size of the sky frame were selected to 
maximize the time spent scanning along the width of the 
sky frame and to minimize the time lost turning around 
at the edges. At the lower frequencies, where the antenna 
beamwidths are larger, it will be possible to observe a stack 
of several sky frames at one time; the resultant observed 
area of sky might be 10 deg high, for example. Careful 
scheduling will permit each sky frame to be observed when 
the width is nearly parallel with the horizon so that the 
antenna elevation angle will be nearly constant during each 
scan line. This is done to improve the uniformity of the 
system sensitivity within the sky frame. 

A scan pattern has been selected which can be de- 
scribed as the “racetrack.” The scan direction is oriented 
along a contour of nearly constant elevation and the order 
of progression of successive scans is chosen to minimize el- 
evation change as time progresses. Thus, the sky frame 
is observed from top to bottom as it rises in the east; the 
same sky frame is observed bottom to top as it sets in 
the west. A schematic diagram of the oval (“racetrack”) 
pattern is shown in Fig. 1. 

Details of this scan pattern have been derived so as to 
utilize the time efficiently, to maintain maximum system 
sensitivity with uniformity across the sky frame, and to 


facilitate the signal detection process. Finally, certain fea- 
tures of this pattern can be used to discriminate against 
some classes of radio frequency interference (RFI). This 
concept is explained in the following section. 


III. Receivers and Signal Processing 

The SETI Sky Survey requires an end-to-end sys- 
tem that includes large antennas equipped with microwave 
feeds, polarizers, amplifiers, and receivers that will span 
the entire frequency band from 1.0 to 10.0 GHz. Pre- 
liminary designs for these subsystems are being studied. 
However, the most challenging components of the sys- 
tem are the spectrum-analyzer and signal-detection sub- 
systems. Much of the challenge of the signal processor 
involves the impact of RFI, which is expected to be prob- 
lematic, but which is poorly characterized at this time. 
The final design of the Sky Survey processor system will 
benefit if a prototype system is tested in an observational 
environment that includes RFI. For this reason the SETI 
Team at JPL decided to build and test a prototype system. 

SETI is collaborating with the DSN Advanced Sys- 
tems Program to design and construct an engineering de- 
velopment model (EDM) of a wideband spectrum analyzer 
system to serve as a prototype for the SETI Sky Survey 
processor. A functional block diagram of the EDM system 
is shown in Fig 2. 

The principal features of the EDM system are as fol- 
lows: 

(1) The Wide Band Spectrum Analyzer (WBSA) is a 
2 21 channel FFT (Fast Fourier Transform) spectrum 
analyzer which covers an instantaneous bandpass of 
40 MHz. The WBSA will operate on the 40-MHz sig- 
nal at selectable resolutions of 20 Hz, 40 Hz, 80 Hz, 
160 Hz, and 320 Hz. Clock-down modes will be avail- 
able to increase each of these resolutions by factors 
of 8 or 64. Arbitrary time-domain windowing func- 
tions can be implemented in the hardware. Two inde- 
pendent accumulators are available so the system can 
function in the Dicke-switched mode commonly used 
for radio astronomy. Two 20-MHz wide channels of 
opposite polarization can also be implemented. 

(2) The Background Estimation Convolution and Thresh- 
old (BECAT) assembly is a hardware module that ap- 
plies the “along scan” detection algorithm described 
below. It uses convolution and order statistics to 
determine the detected power baseline in segments 
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across the passband and to apply a signal threshold 
that reduces the data rate by a factor of 10 3 or 10 4 . 

(3) The EDM software system manager is a network- 
based C language environment based on the ANSI C 
standard. It provides observatory control, data acqui- 
sition, and analysis and it designed to be portable to 
any system running Berkeley UNIX. 

The WBSA has been fully designed and is presently 
under construction. It operates at 4.5 GOPS (Giga oper- 
ations per second) and uses 20-MHz IEEE standard arith- 
metic units. The system contains 76 Mb of dynamic ran- 
dom access memory (DRAM). It is contained in a single 
electronics equipment rack and is controlled by a separate 
Masscomp 5600 computer. Specifications for the BECAT 
module are approaching completion; design work will be- 
gin in the near future. 

The EDM system hardware consists of 37 large-size 
(40 x 40 cm) circuit boards capable of holding 600 16-pin 
dual-in-line-packages (DIPs). Twenty boards are unique 
designs and 17 are copies. To date, 5 boards have been 
completely designed, fabricated and tested; 8 other boards 
are currently in process. A completed fixed-point Fast 
Fourier Transform (FFT) board is shown in Fig. 3. 

The EDM software system has been designed and a 
framework program is currently operating at JPL. The 
first of three successive software “builds” is scheduled for 
completion by the end of 1989. The functionality of the 
system is increased as each “build” is completed. 


IV. Sky Survey Signal Processing 

The instantaneous bandpass of the EDM is 40 MHz. 
Because the system gain profile will not be perfectly flat 
over the passband, the first step in the background estima- 
tion is to normalize the spectral data across the band. This 
is done by multiplying with an inverse gain matrix that is 
regularly updated during the calibrations performed as the 
antenna is turning from one scan to the next. 

The resulting normalized spectrum is then divided 
into subspectra whose bandwidths are optimized for “spec- 
tral flatness.” Each subspectrum is then examined to de- 
termine a pair of order statistics. The rith smallest order 
statistic is used to determine the mean noise temperature 
of the subspectrum and the r 2 th smallest is a test of the 
validity of the assumed probability distribution function. 
The algorithm leads to an excellent estimate of the noise 
power in each subspectrum. The estimate is also robust 


against strong RFI because it avoids using outliers of the 
probability distribution function. 

For the next stages of the EDM detection process, it 
is convenient to describe the steps as they are applied to a 
single frequency bin. These steps are schematically shown 
in Fig. 4. As the antenna beam scans across the sky at a 
rapid rate, the WBSA produces a complete 2 21 point spec- 
trum every 50 msec. For each frequency bin in the spec- 
trum, the WBSA hardware accumulates several spectra 
and passes the accumulated data on to the BECAT mod- 
ule (Step 2). The number of accumulated spectra and the 
antenna scan rate are chosen so that five independent accu- 
mulations are acquired during the time that the beam has 
moved one half-power beam width (HPBW) on the sky. In 
Step 3, the accumulated data samples for each frequency 
bin are convolved with a 5-point filter that matches the 
shape of the point-source response function (output power 
versus time) for the scanning antenna beam. The output is 
compared to the mean noise power (described above) and 
signals that exceed a preselected threshold are compared 
with RFI frequency “maps” resident in the system. Those 
which do not match frequency patterns of previously de- 
tected RFI are stored on disk and passed on to the signal 
detection software (Step 4). 

Next, one considers the inter-scan tests applied in the 
EDM software. The algorithm is based on the underly- 
ing assumption that putative extraterrestrial signals will 
be originating so far from earth that they will be spa- 
tially fixed on the sky; that is, they will match the side- 
real motion of the stars. Therefore, a signal with sufficient 
strength will appear on two or more adjacent scan lines, 
and the lines will appear at the same place along the scan 
line because the sky frames are produced after the scan 
line data are compensated for sidereal motion. This fea- 
ture makes interstellar signals distinguishable from RFI 
and random noise, which will generally not move with 
sidereal rate and direction. A schematic description of 
this process is shown in Fig. 5. 

Another useful discriminator against RFI is the ratio 
test, which also utilizes the features of adjacent scan lines 
as well as knowledge of the beam shape in the direction 
orthogonal to the scan line. For this test we identify a 
candidate signal, then look for a companion signal on the 
adjacent scan lines. If the candidate signal is strong, the 
companion signal (s) should appear with the proper ratio 
unless the signal is RFI or has disappeared in the time 
interval between scans. The effectiveness of the ratio test 
diminishes as candidate signals become weak relative to 
the system noise. 


220 


The BECAT module is designed to provide up to 8192 
reports during each spectrum accumulation. This corre- 
sponds to a data rate of 800 Kbytes per sec, which can be 
compared with the 84,000 Kbytes per second data stream 
from the WBSA accumulator. Each report consists of four 
numbers, as follows: 

(1) The first frequency of a contiguous run of bins that 
exceeds threshold 

(2) The number of bins in the contiguous run 

(3) The integrated (convolved) power contained within 
the run 

(4) The maximum (unconvolved) power in a single bin of 
the run 

The first three quantities characterize the signal that 
appeared above threshold. The fourth quantity is used to 
determine if the detection was due to a signal that was 
present throughout the five accumulations that match the 
convolutional filter, or if a single accumulation contained a 
burst with sufficient intensity to exceed threshold even af- 
ter being diluted by the four other accumulations. Events 
which fail this test cannot be summarily dismissed as RFI, 
however, because they could be caused by a pulsed signal. 
These events will be identified for follow-up observations. 


V. Duration of the Sky Survey 

The time required to complete the Sky Survey de- 
pends upon several factors that affect the scan rate and 
the number of scans that must be made. Some of these 
parameters have already been selected: the survey will 
span the frequency range from 1 to 10 GHz, and it will 
be carried out with 34-m parabolic antennas. Preliminary 
designs of microwave receiving systems indicate that the 
instantaneous bandwidth for the 6 lowest frequency bands 
will be 160 MHz, and the bandwidth for the remaining 25 
higher frequency bands will be 320 MHz. 

The spacing of the adjacent scan tracks in the sky 
frame affects the spatial uniformity of the survey sensi- 
tivity. The nominal spacing is 1/2 HPBW. As the HPBW 
decreases with frequency, the number of scan lines for each 
sky frame will increase linearly with frequency. 


The one remaining free parameter is the angular scan 
rate of the antenna. The tradeoff is between sensitivity and 
survey duration; slow scan rates improve the sensitivity 
and extend the time required to complete the survey. The 
JPL studies show that a reasonable compromise is to scan 
at 0.2 deg per sec and to schedule about 30 sec to complete 
the turnaround between scans. With these rates, one can 
compute the time required to scan the sky for each of the 
31 frequency bands that span the 1-10 GHz range. The 
result is shown in Fig. 6. Note that the scan time increases 
almost a factor of 10 from the lowest to highest frequency 
surveys. 

The right-hand ordinate in Fig. 6 displays the cumu- 
lative scan time as successively higher frequency surveys 
are included in the ensemble. For our scan rate of 0.2 
deg per sec, the complete survey will require 725 days, 
or about 2 years of continuous round-the-clock observa- 
tions. Of course, this means that the real survey will take 
3 or 4 times as long when one considers that the antenna 
may only be scheduled one-third of the time for the sur- 
vey and that even scheduled time must be allocated for 
maintenance, weather, and follow-up observations of “in- 
teresting” signals. Thus the observing schedule must be 
adhered to in order to complete the survey in the allot- 
ted time. The alternative would likely be to reduce the 
frequency range of the survey. 


VI. Conclusions 

The EDM system is scheduled to undergo validation 
tests at Goldstone in 1990. These include the following: 

(1) Testing and refinement of the signal detection algo- 
rithms 

(2) Testing of alternative scan strategies and observatory 
control functions 

(3) Testing of RFI rejection techniques 

Results from these tests need to be fed back into the 
design of the full-scale Sky Survey processor, which will be 
built in the early 1990s. 
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Table 1. Objectives of the NASA SETI sky survey 


Parameters 

Objectives 

Spatial coverage 

Entire celestial sphere 

Frequency range 

1.0 to 10.0 GHz inclusive and 
higher spot bands 

Duration 

« 6 years 

Frequency resolution 

ss 30 Hz 

Instantaneous bandpass 

ss 300 MHz 

Sensitivity 

< io _ 2 V/( GHz ) Wm “ 2 

Polarization 

Simultaneous dual circular 

Signals 

Primarily CW with natural radio 
astronomy fallout 



Fig. 1. Sky survey oval scan pattern. 
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HARDWARE 



Fig. 2. Prototype system functional diagram. 
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Fig. 3. Engineering Development Model (EDM) of survey processor: 
a fixed-point FFT board. 
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SKY SURVEY SCAN LINES 
DIRECTION OF TELESCOPE MOTION 

(a) 

OOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO — ► 


(b) 



• = ACCUMULATION POINT 

Fig. 4. Signal processing in prototype hardware: (a) high- 
speed sampling; (b) several spectra are co-added (ac- 
cumulated) by the WBSA; (c) a 5-point convolutional 
filter is applied by the BECAT; (d) data above BECAT 
threshold are passed to the signal-detection software. 
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SKY SURVEY SCAN LINES 
DIRECTION OF TELESCOPE MOTION 



• = ACCUMULATION POINT ABOVE BECAT THRESHOLD 

Fig. 5. Signal processing in prototype hardware. The 
detection software compares candidate points aligned 
on adjacent spans. 



FREQUENCY, GHz 


Fig. 6. Time required to scan the sky in 31 adjacent 
frequency bands. 


CUMULATIVE SCAN TIME, days 
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